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SECRETED PROTEINS AND POLYNUCLEOTIDES ENCODING THEM 

15 

This is a continuation-in-part of Ser. No. 60/XXX,XXX, filed December 13, 1996 
(converted to provisional application from non-provisional application Ser. No. 
08/766,263), which is incorporated by reference herein. 

20 FIELD OF THE INVENTION 

The present invention provides novel polynucleotides and proteins encoded by 
such polynucleotides, along with therapeutic, diagnostic and research utilities for these 
polynucleotides and proteins. 

25 BACKGROUND OF THE INVENTION 

Technology aimed at the discovery of protein factors (including e.g., cytokines, 
such as lymphokines, interferons, CSFs and interleukins) has matured rapidly over the 
past decade. The now routine hybridization cloning and expression cloning techniques 
done novel polynucleotides "directly" in the sense that they rely on information directly 

3 0 related to the discovered protein (i.e., partial DNA/amino acid sequence of the protein 
in the case of hybridization cloning; activity of the protein in the case of expression 
cloning). More recent "indirect" cloning techniques such as signal sequence cloning, which 
isolates DNA sequences based on the presence of a now well-recognized secretory leader 
sequence motif, as well as various PCR-based or low stringency hybridization cloning 

3 5 techniques, have advanced the state of the art by making available large numbers of 
DNA/amino acid sequences for proteins that are known to have biological activity by 
virtue of their secreted nature in the case of leader sequence cloning, or by virtue of the 
cell or tissue source in the case of PCR-based techniques. It is to these proteins and the 
polynucleotides encoding them that the present invention is directed. 

40 
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SUMMARY OF THE INVENTION 
In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

5 NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 22 to nucleotide 462; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AJ1_1 deposited under accession number 

10 ATCC 98278; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AJ1_1 deposited under accession number ATCC 98278; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AJ1_1 deposited under accession number ATCC 

15 98278; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AJ1_1 deposited under accession number ATCC 98278; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

20 (h) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:2 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
25 of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:l from nucleotide 22 to nucleotide 462; the nucleotide sequence of the full-length 
3 0 protein coding sequence of clone AJ1_1 deposited under accession number ATCC 98278; 
or the nucleotide sequence of the mature protein coding sequence of clone AJ1_1 
deposited under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone AJ1_1 deposited under accession number ATCC 98278. In yet other preferred 
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embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID N02 from amino acid 52 to amino acid 
147. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
5 ID NCfcl or SEQ ID NO.3. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

10 (b) the amino acid sequence of SEQ ID NO:2 from amino acid 52 to 

amino acid 147; 

(c) fragments of the amino acid sequence of SEQ ID NO'2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
AJ1_1 deposited under accession number ATCC 98278; 

15 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:2 or the amino acid sequence 
of SEQ ID NO.2 from amino acid 52 to amino acid 147. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
20 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:4; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:4 from nucleotide 7 to nucleotide 1647; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

2 5 NCh4 from nucleotide 1 to nucleotide 305; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done AQ73_3 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 

3 0 cDNA insert of clone AQ73_3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AQ73J3 deposited under accession number 
ATCC 98278; 
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(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AQ73_3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NOS; 

5 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NQ5 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
10 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-{i)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NCh4 from nucleotide 7 to nucleotide 1647; the nucleotide sequence of SEQ ID NQ.4 from 
15 nucleotide 1 to nucleotide 305; the nucleotide sequence of the full-length protein coding 
sequence of clone AQ73_3 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone AQ73J3 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

2 0 clone AQ73..3 deposited under accession number ATCC 98278. In yet other preferred 

embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID N05 from amino acid 1 to amino acid 68. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
K>NO:4. 

25 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID N05; 

(b) the amino acid sequence of SEQ ID NO:5 from amino acid 1 to 

3 0 amino acid 68; 

(c) fragments of the amino acid sequence of SEQ ID NO:5; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
AQ73_3 deposited under accession number ATCC 98278; 
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the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO'3 or the amino acid sequence 
of SEQ ID NO.5 from amino acid 1 to amino acid 68. 

In one embodiment, the present invention provides a composition comprising an 
5 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NCfc6; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:6 from nucleotide 62 to nucleotide 757; 

10 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:6 from nucleotide 357 to nucleotide 703; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BG142_1 deposited under accession 
number ATCC 98278; 

15 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone BG142_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BG142JL deposited under accession number 
ATCC 98278; 

20 (g) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone BG142_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:7; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
2 5 amino acid sequence of SEQ ID NO:7 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

30 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NQ6 from nucleotide 62 to nucleotide 757; the nucleotide sequence of SEQ ID NO:6 from 
nucleotide 357 to nucleotide 703; the nucleotide sequence of the full-length protein coding 
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sequence of clone BG142JL deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of done BG142_1 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
5 done BG142_1 deposited under accession number ATCC 98278. In yet other preferred 
embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:7 from amino add 184 to amino add 
214. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
10 IDNO:6. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:7; 

15 (b) the amino acid sequence of SEQ ID NO:7 from amino add 184 to 

amino add 214; 

(c) fragments of the amino add sequence of SEQ ID NO:7; and 

(d) the amino add sequence encoded by the cDNA insert of done 
BG142_1 deposited under accession number ATCC 98278; 

2 0 the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino add sequence of SEQ ID NO:7 or the amino add sequence 
of SEQ ID NO.7 from amino add 184 to amino add 214. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynudeotide selected from the group consisting of: 
25 (a) a polynudeotide comprising the nudeotide sequence of SEQ ID 

NO:8; 

(b) a polynudeotide comprising the nudeotide sequence of SEQ ID 
NO.8 from nudeotide 404 to nudeotide 535; 

(c) a polynudeotide comprising the nudeotide sequence of SEQ ID 

3 0 NO:8 from nudeotide 1 to nudeotide 666; 

(d) a polynudeotide comprising the nudeotide sequence of the full- 
length protein coding sequence of done BV66_1 deposited under accession 
number ATCC 98278; 
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(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BV66_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BV66_1 deposited under accession number 

5 ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BV66_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:9; • 

10 (i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO:9 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 

15 of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:8 from nucleotide 404 to nucleotide 535; the nucleotide sequence of SEQ ID N08 

20 from nucleotide 1 to nucleotide 666; the nucleotide sequence of the full-length protein 
coding sequence of clone BV66_1 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone BV66_1 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 

2 5 clone BV66_1 deposited under accession number ATCC 98278. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:9 from amino acid 1 to amino acid 38. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:8. 

30 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:9; 



7 



WO 98/25962 



PCT/US97/23224 



(b) the amino acid sequence of SEQ ID NO:9 from amino acid 1 to 
amino acid 38; 

(c) fragments of the amino acid sequence of SEQ ID NO:9; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
5 BV66_1 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:9 or the amino acid sequence 
of SEQ ID NCh9 from amino acid 1 to amino acid 38. 

In one embodiment, the present invention provides a composition comprising an 
1 0 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NOlO; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NOrlO from nucleotide 1204 to nucleotide 1389; 

15 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NChlO from nucleotide 881 to nucleotide 1380; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BV291_3 deposited under accession 
number ATCC 98278; 

20 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone BV291_3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BV291_3 deposited under accession number 
ATCC 98278; 

25 (g) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone BV291.3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO.ll; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
3 0 amino acid sequence of SEQ ID NO:ll having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

8 
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(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO10 from nucleotide 1204 to nucleotide 1389; the nucleotide sequence of SEQ ID NO:10 
5 from nucleotide 881 to nucleotide 1380; the nucleotide sequence of the full-length protein 
coding sequence of done BV291_3 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone BV291.3 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
10 clone BV291_3 deposited under accession number ATCC 98278. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:ll from amino acid 1 to amino acid 
59. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
15 IDNOrlO. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting ot 

(a) the amino acid sequence of SEQ ID NChll; 
20 (b) the amino acid sequence of SEQ ID NO:ll from amino acid 1 to 

amino acid 59; 

(c) fragments of the amino acid sequence of SEQ ID NO:ll; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BV291_3 deposited under accession number ATCC 98278; 

25 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:ll or the amino acid sequence 
of SEQ ID NO-.ll from amino acid 1 to amino acid 59. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of. 

30 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

Nai2; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 189 to nucleotide 1115; 
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(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO.12 from nucleotide 1 to nucleotide 451; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CK201.1 deposited under accession 

number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone CK201_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CK201.1 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CK201_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino aad 

sequence of SEQ ID NQ.13; 

(i) a polynucleotide encoding a protein comprising a fragment of the 

amino acid sequence of SEQ ID NO.13 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 

(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 

of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (aH»)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO-12 from nucleotide 189 to nucleotide 1115; the nucleotide sequence of SEQ ID NO:12 
from nucleotide 1 to nucleotide 451; the nucleotide sequence of the full-length protein 
codingsequenceofcloneCK201_ldeposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone CK201_1 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 

polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
D clone CK201 1 deposited under accession number ATCC 98278. In yet other preferred 

embodiments, the present invention provides a polynucleotide encoding a protein 

comprising the amino acid sequence of SEQ ID NO:13 from amino acid 1 to ammo aad 

88. 
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Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NChl2. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
5 consisting of: 

(a) the amino acid sequence of SEQ ID NO:13; 

(b) the amino acid sequence of SEQ ID NO:13 from amino acid 1 to 
amino acid 88; 

(c) fragments of the amino acid sequence of SEQ ID NO:13; and 

10 (d) the amino acid sequence encoded by the cDNA insert of clone 

CK201_1 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. Preferably such 

protein comprises the amino acid sequence of SEQ ID NO:13 or the amino acid sequence 

of SEQ ID NO:13 from amino acid 1 to amino acid 88. 
15 In one embodiment the present invention provides a composition comprising an 

isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
2 0 NO:14 from nucleotide 117 to nucleotide 923; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 174 to nucleotide 923; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
N014 from nucleotide 1 to nucleotide 316; 

25 (e) a polynucleotide comprising the nucleotide sequence of the full- 

length protein coding sequence of clone CQ331_2 deposited under accession 

number ATCC 98278; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CQ331_2 deposited under accession number ATCC 98278; 
30 (g) a polynucleotide comprising the nucleotide sequence of the mature 

protein coding sequence of clone CQ331J2 deposited under accession number 
ATCC 98278; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CQ331J2 deposited under accession number ATCC 98278; 

11 
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(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:15; 

(j) a polynucleotide encoding a protein comprising a fragment of the* 
amino acid sequence of SEQ ID NO:15 having biological activity; 
5 (k) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 

of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 

10 to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO.14 from nucleotide 117 to nucleotide 923; the nucleotide sequence of SEQ ID NO:14 
from nucleotide 174 to nucleotide 923; the nucleotide sequence of SEQ ID NO:14 from 
nucleotide 1 to nucleotide 316; the nucleotide sequence of the full-length protein coding 

15 sequence of done CQ331_2 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone CQ331_2 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
done CQ331_2 deposited under accession number ATCC 98278. In yet other preferred 

20 embodiments, the present invention provides a polynudeotide encoding a protein 
comprising the amino add sequence of SEQ ID NO:15 from amino add 1 to amino add 
57. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
ID NO:14. ' 

25 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino add sequence selected from the group 
consisting of: 

(a) the amino add sequence of SEQ ID NO:15; 

(b) the amino add sequence of SEQ ID NO:15 from amino add 1 to 
3 0 amino add 57; 

(c) fragments of the amino add sequence of SEQ ID NO:15; and 

(d) the amino add sequence encoded by the cDNA insert of done 
CQ331_2 deposited under accession number ATCC 98278; 
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the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO: 15 or the amino acid sequence 
of SEQ ID NO:15 from amino acid 1 to amino acid 57. 

In one embodiment, the present invention provides a composition comprising an 
5 isolated polynucleotide selected from the group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:16; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NCM6 from nucleotide 223 to nucleotide 483; 

10 (c) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NChl6 from nucleotide 22 to nucleotide 397; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT550_1 deposited under accession 
number ATCC 98278; 

15 (e) a polynucleotide encoding the full-length protein encoded by the 

cDNA insert of clone CT550_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CT550_1 deposited under accession number 
ATCC 98278; 

20 (g) a polynucleotide encoding the mature protein encoded by the 

cDNA insert of clone CT550_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO.17; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
2 5 amino acid sequence of SEQ ID NO:17 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 

(aMg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

30 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:16 from nucleotide 223 to nucleotide 483; the nucleotide sequence of SEQ ID NO:16 
from nucleotide 22 to nucleotide 397; the nucleotide sequence of the full-length protein 
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coding sequence of done CT550_1 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone CT550_1 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
5 clone CT550_1 deposited under accession number ATCC 98278. In yet other preferred 
embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:17 from amino acid 1 to amino acid 
58. 

Other embodiments provide the gene corresponding to the cDN A sequence of SEQ 
10 IDNO:16. 

In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of. 

(a) the amino acid sequence of SEQ ID N017; 

15 (b) the amino acid sequence of SEQ ID NO:17 from amino acid 1 to 

amino acid 58; 

(c) fragments of the amino acid sequence of SEQ ID NO:17; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CT550J. deposited under accession number ATCC 98278; 

20 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:17 or the amino acid sequence 
of SEQ ID NO:17 from amino acid 1 to amino acid 58. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of. 
25 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:18; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 112 to nucleotide 969; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
3 0 NO:18 from nucleotide 154 to nucleotide 969; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 1 to nucleotide 423; 
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(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT585_1 deposited under accession 
number ATCC 98278; 

(f) a polynucleotide encoding the full-length protein encoded by the 
5 cDN A insert of clone CT585J1 deposited under accession number ATCC 98278; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CT585_1 deposited under accession number 
ATCC 98278; 

(h) a polynucleotide encoding the mature protein encoded by the 
1 0 cDNA insert of clone CT585_1 deposited under accession number ATCC 98278; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:19; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:19 having biological activity; 
15 (k) a polynucleotide which is an allelic variant of a polynucleotide of 

(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
20 to any one of the polynucleotides specified in (a)-(j). 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 112 to nucleotide 969; the nucleotide sequence of SEQ ID NChl8 
from nucleotide 154 to nucleotide 969; the nucleotide sequence of SEQ ID NO:18 from 
nucleotide 1 to nucleotide 423; the nucleotide sequence of the full-length protein coding 
25 sequence of clone CT585J. deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone CT585_1 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CT585_1 deposited under accession number ATCC 98278. In yet other preferred 
30 embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID NO:19 from amino acid 1 to amino acid 
104. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:18. 
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In other embodiments, the present invention provides a composition comprising 
a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NO:19; 

5 (b) the amino acid sequence of SEQ ID NO:19 from amino acid 1 to 

amino acid 104; 

(c) fragments of the amino acid sequence of SEQ ID NO:19; and 

(d) the amino acid sequence encoded by the cDNA insert of done 
CT585_1 deposited under accession number ATCC 98278; 

10 the protein being substantially free from other mammalian proteins. Preferably such 
protein comprises the amino acid sequence of SEQ ID NO:19 or the amino acid sequence 
of SEQ ID N019 from amino acid 1 to amino acid 104. 

In one embodiment, the present invention provides a composition comprising an 
isolated polynucleotide selected from the group consisting of: 
15 (a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:20; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:20 from nucleotide 37 to nucleotide 2766; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
2 0 NO20 from nucleotide 243 to nucleotide 789; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT797J3 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 

2 5 cDNA insert of clone CT797_3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CT797_3 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 

3 0 cDN A insert of clone CT797_3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 

sequence of SEQ ID NO:21; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:21 having biological activity; 
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(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

5 (1) a polynucleotide capable of hybridizing under stringent conditions 

to any one of the polynucleotides specified in (a)-(i)- 

Preferably, such polynucleotide comprises the nucleotide sequence of SEQ ID 
NCh20 from nucleotide 37 to nucleotide 2766; the nucleotide sequence of SEQ ID NO20 
from nucleotide 243 to nucleotide 789; the nucleotide sequence of the full-length protein 

1 0 coding sequence of done CI797_3 deposited under accession number ATCC 98278; or the 
nucleotide sequence of the mature protein coding sequence of clone CT797_3 deposited 
under accession number ATCC 98278. In other preferred embodiments, the 
polynucleotide encodes the full-length or mature protein encoded by the cDNA insert of 
clone CT797_3 deposited under accession number ATCC 98278. In yet other preferred 

15 embodiments, the present invention provides a polynucleotide encoding a protein 
comprising the amino acid sequence of SEQ ID N021 from amino acid 75 to amino acid 
251. 

Other embodiments provide the gene corresponding to the cDNA sequence of SEQ 
ID NO:20. 

20 In other embodiments, the present invention provides a composition comprising 

a protein, wherein said protein comprises an amino acid sequence selected from the group 
consisting of: 

(a) the amino acid sequence of SEQ ID NCh21; 

(b) the amino acid sequence of SEQ ID NCh21 from amino acid 75 to 

2 5 amino acid 251; 

(c) fragments of the amino acid sequence of SEQ ID NO:21; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CT797J3 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. Preferably such 

3 0 protein comprises the amino acid sequence of SEQ ID NO:21 or the amino acid sequence 

of SEQ ID NO:21 from amino acid 75 to amino acid 251. 

In certain preferred embodiments, the polynucleotide is operably linked to an 
expression control sequence. The invention also provides a host cell, including bacterial, 
yeast, insect and mammalian cells, transformed with such polynucleotide compositions. 
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Processes are also provided for producing a protein, which comprise: 

(a) growing a culture of the host cell transformed with such 
polynucleotide compositions in a suitable culture medium; and 

(b) purifying the protein from the culture. 

5 The protein produced according to such methods is also provided by the present 
invention- Preferred embodiments include those in which the protein produced by such 
process is a mature form of the protein. 

Protein compositions of the present invention may further comprise a 
pharmaceutical^ acceptable carrier. Compositions comprising an antibody which 
1 0 specifically reacts with such protein are also provided by the present invention. 

Methods are also provided for preventing, treating or ameliorating a medical 
condition which comprises administering to a mammalian subject a therapeutically 
effective amount of a composition comprising a protein of the present invention and a 
pharmaceutically acceptable carrier. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figures 1 A and IB are schematic representations of the pED6 and pNOTs vectors, 
respectively, used for deposit of clones disclosed herein. 

20 DETAILED DESCRIPTION 

ISOLATED PROTEINS AND POLYNUCLEOTIDES 

Nucleotide and amino acid sequences, as presently determined, are reported 
below for each clone and protein disclosed in the present application. The nucleotide 
sequence of each clone can readily be determined by sequencing of the deposited clone 

25 in accordance with known methods. The predicted amino acid sequence (both full-length 
and mature) can then be determined from such nucleotide sequence. The amino acid 
sequence of the protein encoded by a particular clone can also be determined by 
expression of the clone in a suitable host cell, collecting the protein and determining its 
sequence. For each disclosed protein applicants have identified what they have 

3 0 determined to be the reading frame best identifiable with sequence information available 
at the time of filing. 

As used herein a "secreted" protein is one which, when expressed in a suitable host 
cell, is transported across or through a membrane, including transport as a result of signal 
sequences in its amino acid sequence. "Secreted" proteins include without limitation 
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proteins secreted wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell 
in which they are expressed. "Secreted" proteins also include without limitation proteins 
which are transported across the membrane of the endoplasmic reticulum. 

5 Clone "ATI 1" 

A polynucleotide of the present invention has been identified as clone "AJl_l n . 
AJ1_1 was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see ILS. Pat No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
10 analysis of the amino acid sequence of the encoded protein. AJ1_1 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 
"AJ1J protein"). 

The nucleotide sequence of the 5' portion of AJ1_1 as presently determined is 
reported in SEQ ID NOl. What applicants presently believe is the proper reading frame 
1 5 for the coding region is indicated in SEQ ID NChZ The predicted amino acid sequence of 
the AJ1_1 protein corresponding to the foregoing nucleotide sequence is reported in SEQ 
ID NO:2. Additional nucleotide sequence from the 3' portion of AJ1_1, including the 
polyA tail, is reported in SEQ ID NO:3. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
2 0 AJ1_1 should be approximately 925 bp. 

The predicted amino acid sequence disclosed herein for AJ1_1 was searched 
against the GenPept and GeneSeq amino acid sequence databases using the BLASTX 
search protocol. The predicted AJ1_1 protein demonstrated at least some similarity to 
sequences identified as U39060 (GRTP1 [Mus musculus]). Based upon sequence similarity, 

2 5 AJ1_1 proteins and each similar protein or peptide may share at least some activity. 

Clone " AQ73 3" 

A polynucleotide of the present invention has been identified as clone "AQ73_3". 
AQ73_3 was isolated from a human adult ovary (PA-1 teratocarcinoma, untreated tissue 

3 0 pooled with retinoic-acid-treated and activin-treated tissue) cDNA library using methods 

which are selective for cDNAs encoding secreted proteins (see US. Pat No. 5,536,637), or 
was identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. AQ73_3 is a full-length done, 
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including the entire coding sequence of a secreted protein (also referred to herein as 
"AQ73_3 protein"). 

The nucleotide sequence of AQ73J3 as presently determined is reported in SEQ 
ID NO:4. What applicants presently believe to be the proper reading frame and the 
5 predicted amino acid sequence of the AQ73JJ protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NQ5. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
AQ73_3 should be approximately 2800 bp. 

The nucleotide sequence disclosed herein for AQ73_3 was searched against the 

10 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. AQ73_3 demonstrated at least some similarity with sequences 
identified as AA514474 (nf57gOLsl NQ_CGAP_Co3 Homo sapiens cDNA clone 924048), 
T47520 (Human hepatoma-derived growth factor (HDGF-2) cDNA), W24708 (zb62e08.rl 
Soares fetal lung NbHL19W Homo sapiens cDNA clone 308198 5'), and W45513 

15 (zc27g08.sl Soares senescent fibroblasts NbHSF Homo sapiens cDNA clone 323582 3'). 
The predicted amino acid sequence disclosed herein for AQ73J3 was searched against the 
GenPept and GeneSeq amino acid sequence databases using the BLASTX search protocol. 
The predicted AQ73_3 protein demonstrated at least some similarity to sequences 
identified as D16431 (hepatoma-derived GF [Homo sapiens]), D63707 (mouse hepatoma 

20 derived growth factor (HDGF) [Mus musculus]), R66727 (Human hepatoma derived 
growth factor), U18997 (ORFJ299 [Escherichia coli]), U97193 (similar to S. cerevisiae SIR2 
(SP P06700) and mouse hepatoma derived growth factor HDGF (NID g945418) 
[Caenorhabditis elegans]), and W09404 (Human hepatoma-derived growth factor 
(HDGF-2)). Based upon sequence similarity, AQ73_3 proteins and each similar protein 

25 or peptide may share at least some activity. 

Clone "BG142 1" 

A polynucleotide of the present invention has been identified as clone M BG142_1". 
BG142_1 was isolated from a human adult brain cDNA library using methods which are 
30 selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. BG142_1 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as W BG142_1 protein"). , 
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The nucleotide sequence of BG142_1 as presently determined is reported in SEQ 
ID NO:6. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the BG142_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NQ7. 
5 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

BG142_1 should be approximately 1100 bp. 

The nucleotide sequence disclosed herein for BG142_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. BG142_1 demonstrated at least some similarity with sequences 

1 0 identified as AA170261 (ms87hll.rl Soares mouse 3NbMS Mus musculus cDNA clone 
618597 5' similar to TR E245601 E245601 G-RICH BOX-BINDING PROTEIN), L04282 
(Human CACCC box-binding protein mRNA, complete cds), N27696 (yx51hl2.rl Homo 
sapiens cDNA done 265319 5'), W96110 (ze09allj*l Soares fetal heart NbHH19W Homo 
sapiens cDNA clone 358460 5'), and W96111 (ze09all.sl Soares fetal heart NbHH19W 

15 Homo sapiens cDNA clone 358460 3'). The predicted amino acid sequence disclosed 
herein for BG142_1 was searched against the GenPept and GeneSeq amino acid sequence 
databases using the BLASTX search protocol. The predicted BG142_1 protein 
demonstrated at least some similarity to sequences identified as U80078 (transcription 
factor BFCOL1 [Mus musculus]) and X98096 (G-rich box-binding protein [Mus 

2 0 musculus]). Based upon sequence similarity, BG142_1 proteins and each similar protein 
or peptide may share at least some activity. 

Clone "BV66 1" 

A polynucleotide of the present invention has been identified as clone W BV66_1". 

2 5 BV66_1 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. BV66_1 is a full-length clone, 
including the entire coding sequence of a secreted protein (also referred to herein as 

30 "BV66_1 protein"). 

The nucleotide sequence of BV66_1 as presently determined is reported in SEQ ID 
NO:8. What applicants presently believe to be the proper reading frame and the predicted 
amino acid sequence of the BV66_1 protein corresponding to the foregoing nucleotide 
sequence is reported in SEQ ID NO#. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
BV66J1 should be approximately 870 bp. 

The nucleotide sequence disclosed herein for BV66_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
5 FASTA search protocols. No hits were found in the database. The nucleotide sequence 
of BV66_1 indicates that it may contain a TAAA1 simple repeat element. 

Clone "BV291 3" 

A polynucleotide of the present invention has been identified as clone M BV291_3". 

1 0 BV291_3 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. BV291_3 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 

15 as M BV291_3 protein"). 

The nucleotide sequence of BV291_3 as presently determined is reported in SEQ 
ID NO.10. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the BV291J3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:ll. 

2 0 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

BV291_3 should be approximately 2000 bp. 

The nucleotide sequence disclosed herein for BV291_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. BV291_3 demonstrated at least some similarity with sequences 
25 identified as H10954 (ym06e09.rl Homo sapiens cDNA clone 47034 5'), H10955 
(ym06e09.sl Homo sapiens cDNA clone 47034 3'), N25300 (yw52cl0.sl Homo sapiens 
cDNA clone 255858 3'), T25940 (Human gene signature HUMGS08173), T68890 
(yc30gll.sl Homo sapiens cDNA clone 82244 3'), T78286 (yc99a08.rl Homo sapiens cDNA 
clone 24033 5'), Z39987 (H. sapiens partial cDNA sequence; clone c-loh05), and Z47073 

3 0 (Caenorhabditis elegans cosmid ZC506). The predicted amino acid sequence disclosed 

herein for BV291_3 was searched against the GenPept and GeneSeq amino acid sequence 
databases using the BLASTX search protocol. The predicted BV291_3 protein 
demonstrated at least some similarity to sequences identified as X02155 (BTTGR_1 
thyroglobulin [Bos taurus]). Based upon sequence similarity, BV291_3 proteins and each 

22 



WO 98/25962 PCT/DS97/23224 

similar protein or peptide may share at least some activity. The TopPredll computer 
program predicts a potential transmembrane domain within the BV291_3 protein 
sequence centered around amino acid 48 of SEQ ID NO.ll. 

5 Clone "CK201 1" 

A polynucleotide of the present invention has been identified as clone "CK201_1". 
CK201_1 was isolated from a human adult testes cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
10 analysis of the amino acid sequence of the encoded protein. CK201_1 is a full-length 
done, including the entire coding sequence of a secreted protein (also referred to herein 
as "CK201JL protein"). 

The nucleotide sequence of CK201_1 as presently determined is reported in SEQ 
ID NO:12. What applicants presently believe to be the proper reading frame and the 
15 predicted amino acid sequence of the CK201_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:13. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CK201_1 should be approximately 1080 bp. 

The nucleotide sequence disclosed herein for CK201_1 was searched against the 
2 0 GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CK201_1 demonstrated at least some similarity with sequences 
identified as AA129133 (zo09hlZsl Stratagene neuroepithelium NT2RAMI 937234 Homo 
sapiens cDNA done 567239 3' similar to contains Alu repetitive element), D81444 (Human 
fetal brain cDNA 5'-end GEN-164G10), R36326 (yg69h09.rl Homo sapiens cDNA done 

2 5 38821 5'), T08553 (Oncogene R-ras mutant cDNA (exons 2-6)), T31595 (Probe (BLUR13) 

for Alu repeat sequence), X03273 (Human Alu-family duster 5' of alpha(l)-add 
glycoprotein gene), and X69907 (H.sapiens gene for mitochondrial ATP synthase c 
subunit). The predicted amino add sequence disdosed herein for CK201_1 was searched 
against the GenPept and GeneSeq amino add sequence databases using the BLASTX 

3 0 search protocol. The predicted CK201_1 protein demonstrated at least some similarity to 

sequences identified as D21827 (major surface glycoprotein [Pneumocystis carinii]). Based 
upon sequence similarity, CK201_1 proteins and each similar protein or peptide may 
share at least some activity. The nudeonde sequence of CK201_1 indicates that it may 
contain an Alu repetitive dement. 
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Clone "CQ331 2" 

A polynucleotide of the present invention has been identified as clone "CQ331_2 W . 
CQ331J2 was isolated from a human adult heart cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5,536,637), or was 
5 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CQ331_2 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "CQ331_2 protein"). 

The nucleotide sequence of CQ331J2 as presently determined is reported in SEQ 

10 ID NO:14. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CQ331_2 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:15. Amino acids 7 to 19 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 20, or are a transmembrane domain. 

1 5 The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 

CQ331_2 should be approximately 1600 bp. 

The nucleotide sequence disclosed herein for CQ331_2 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CQ331_2 demonstrated at least some similarity with sequences 

2 0 identified as J03766 (Canine cardiac calsequestrin mRNA, complete cds), L29766 (Homo 
sapiens epoxide hydrolase (EPHX) gene, complete cds), N83601 (KK1173F Homo sapiens 
cDNA clone KK1173 5' similar to CALSEQUESTRIN (CARDIAC)), T99646 (ye73fl2.sl 
Homo sapiens cDN A clone 123407 3' similar to contains Alu repetitive element;contains 
FTR5 repetitive element), and W76326 (zd60d04.rl Soares fetal heart NbHH19W Homo 

2 5 sapiens cDNA clone 345031 5' similar to contains Alu repetitive element). The predicted 
amino acid sequence disclosed herein for CQ331_2 was searched against the GenPept and 
GeneSeq amino acid sequence databases using the BLASTX search protocol. The 
predicted CQ331_2 protein demonstrated at least some similarity to sequences identified 
as J03766 (DOGCAL_l Canine cardiac calsequestrin mRNA, complete cds [Canis canis]) 

30 and X55040 (calsequestrin [Oryctolagus cuniculus]). Based upon sequence similarity, 
CQ331.2 proteins and each similar protein or peptide may share at least some activity. 
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Clone "CT550 1" 

A polynucleotide of the present invention has been identified as clone M CT550_1". 
CT550_1 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat. No. 5,536,637), or was 
5 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CT550_1 is a full-length 
done, including the entire coding sequence of a secreted protein (also referred to herein 
as "CT550_1 protein"). 

The nucleotide sequence of CT550_1 as presently determined is reported in SEQ 
10 ID NO:16. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CT550_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:17. 

The EeoRI /NotI restriction fragment obtainable from the deposit containing clone 
CT550_1 should be approximately 1070 bp. 
15 The nucleotide sequence disclosed herein for CT550_1 was searched against the 

GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. No hits were found in the database. The TopPredll computer 
program predicts a potential transmembrane domain within the CT550_1 protein 
sequence centered around amino acid 25 of SEQ ID NO:17. 

20 

Clone "CT585 1" 

A polynucleotide of the present invention has been identified as clone "CT585_l n . 
CT585_1 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5,536,637), or was 

25 identified as encoding a secreted or transmembrane protein on the basis of computer 
analysis of the amino acid sequence of the encoded protein. CT585_1 is a full-length 
clone, including the entire coding sequence of a secreted protein (also referred to herein 
as "CT585J protein"). 

The nucleotide sequence of CT585__1 as presently determined is reported in SEQ 

30 ID NO:18. What applicants presently believe to be the proper reading frame and the 
predicted amino acid sequence of the CT585_1 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:19. Amino acids 2 to 14 are a predicted 
leader/signal sequence, with the predicted mature amino acid sequence beginning at 
amino acid 15, or are a transmembrane domain. 
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The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CT585_1 should be approximately 2710 bp. 

The nucleotide sequence disclosed herein for CT585_1 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
5 FASTA search protocols. CT585_1 demonstrated at least some similarity with sequences 
identified as AA069442 (zf74b02.sl Soares pineal gland N3HPG Homo sapiens cDNA 
clone 382635 3'), L38961 (Homo sapiens putative transmembrane protein (B5) mRNA, 
complete cds), N34932 (yy49bl0.sl Homo sapiens cDNA clone 276859 3'), N60101 
(TgESTzyllfl0.rl Toxoplasma gondii cDNA clone tgzyllfl0.rl 5'), and U13019 

1 0 (Caenorhabditis elegans cosmid T12A2). The predicted amino acid sequence disclosed 
herein for CT585_1 was searched against the GenPept, GeneSeq, and SwissProt amino 
acid sequence databases using the BLASTX search protocol. The predicted CT585JL 
protein demonstrated at least some similarity to sequences identified as L34260 
(transmembrane protein [Mus musculus]), L38961 (transmembrane protein [Homo 

1 5 sapiens]), P46975 (Caenorhabditis elegans oligosaccharyl transferase stt3 [Caenorhabditis 
elegans]), and U13019 (Caenorhabditis elegans cosmid T12A2 [Caenorhabditis elegans]). 
Based upon sequence similarity, CT585_1 proteins and each similar protein or peptide 
may share at least some activity. 

20 Clone "CT797 3" 

A polynucleotide of the present invention has been identified as clone "CT797_3". 
CT797_3 was isolated from a human adult brain cDNA library using methods which are 
selective for cDNAs encoding secreted proteins (see U.S. Pat No. 5,536,637), or was 
identified as encoding a secreted or transmembrane protein on the basis of computer 
25 analysis of the amino acid sequence of the encoded protein. CT797_3 is a full-length 
done, including the entire coding sequence of a secreted protein (also referred to herein 
as "CT797_3 protein"). 

The nucleotide sequence of CT797J3 as presently determined is reported in SEQ 
ID NO:20. What applicants presently believe to be the proper reading frame and the 
3 0 predicted amino acid sequence of the CT797J3 protein corresponding to the foregoing 
nucleotide sequence is reported in SEQ ID NO:21. 

The EcoRI/NotI restriction fragment obtainable from the deposit containing clone 
CT797_3 should be approximately 3300 bp. 
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The nucleotide sequence disclosed herein for CT797_3 was searched against the 
GenBank and GeneSeq nucleotide sequence databases using BLASTN/BLASTX and 
FASTA search protocols. CT797_3 demonstrated at least some similarity with sequences 
identified as AA573847 (nk08d06.sl NCI_CGAP_Co2 Homo sapiens cDNA clone 
5 IMAGE: 101 2907). The predicted amino acid sequence disclosed herein for CT797_3 was 
searched against the GenPept and GeneSeq amino acid sequence databases using the 
BLASTX search protocol. The predicted CI797_3 protein demonstrated at least some 
similarity to sequences identified as U18309 (chromokinesin [Gallus gallus]) and Z82271 
(T01G1.1 [Caenorhabditis elegans]). Based upon sequence similarity, CT797_3 proteins 
1 0 and each similar protein or peptide may share at least some activity. 

Deposit of Clones 

Clones AJ1_1, AQ73_3, BG142J, BV66_1, BV291_3, CK201J, CQ331_2, CT550 _1, 
CT585_1 and CT797_3 were deposited on December 13, 1996 with the American Type 

1 5 Culture Collection as an original deposit under the Budapest Treaty and were given the 
accession number ATCC 98278, from which each clone comprising a particular 
polynucleotide is obtainable. AD restrictions on the availability to the public of the 
deposited material will be irrevocably removed upon the granting of the patent, except 
for the requirements specified in 37 C.F.R. § 1.808(b). 

20 Each clone has been transfected into separate bacterial cells (E. colt) in this 

composite deposit Each done can be removed from the vector in which it was deposited 
by performing an EcoRI/NotI digestion (5' site, EcoRI; 3* site, NotI) to produce the 
appropriate fragment for such clone. Each clone was deposited in either the pED6 or 
pNOTs vector depicted in Fig. 1. The pED6dpc2 vector ("pED6") was derived from 

2 5 pED6dpcl by insertion of a new poiylinker to facilitate cDNA cloning (Kaufman et al, 

1991, Nucleic Acids Res, 19: 448S4490); the pNOTs vector was derived from pMT2 
(Kaufman et al, 1989, Mol Cell Biol 9: 946-958) by deletion of the DHFR sequences, 
insertion of a new poiylinker, and insertion of the M13 origin of replication in the Clal site. 
In some instances, the deposited clone can become "flipped" (i.e., in the reverse 

3 0 orientation) in the deposited isolate. In such instances, the cDNA insert can still be 

isolated by digestion with EcoRI and NotI. However, NotI will then produce the 5' site 
and EcoRI will produce the 3' site for placement of the cDNA in proper orientation for 
expression in a suitable vector. The cDNA may also be expressed from the vectors in 
which they were deposited. 
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Bacterial cells containing a particular done can be obtained from the composite, 
deposit as follows: 

An oligonucleotide probe or probes should be designed to the sequence that is 
known for that particular clone. This sequence can be derived from the sequences 
5 provided herein, or from a combination of those sequences. The sequence of the 
oligonucleotide probe that was used to isolate each full-length clone is identified below, 
and should be most reliable in isolating the clone of interest. 

gone Probe Sequence 

10 AJ1_1 SEQ ID NO:22 

AQ73.3 SEQ ID NO:23 

BG142_1 SEQ ID NO:24 

BV66_1 SEQ ID NCh25 

BV291_3 SEQ ID NO:26 

1 5 CK201_1 SEQ ID NO:27 

CQ331_2 SEQ ID NO:28 

CT550JL SEQ ID NO.29 

CT585_1 SEQ ID NO:30, SEQ ID NO:32 



20 



CT797_3 SEQ ID NO:31 



In the sequences listed above which include an N at position 2, that position is occupied 
in preferred probes/primers by a biotinylated phosphoaramidite residue rather than a 
nucleotide (such as , for example, that produced by use of biotin phosphoramidite (1- 
dimethoxytrityloxy-2-(N-biotinyl-4-amin^ 
2 5 diisopropyl)-phosphoramadite) (Glen Research, cat no. 10-1953)). 

The design of the oligonucleotide probe should preferably follow these 
parameters: 

(a) It should be designed to an area of the sequence which has the fewest 
ambiguous bases ("NTs"), if any; 
30 (b) It should be designed to have a T m of approx. 80 0 C (assuming 2° for each 

A or T and 4 degrees for each G or C). 
The oligonucleotide should preferably be labeled with g-^P ATP (specific activity 6000 
Ci/mmole) and T4 polynucleotide kinase using commonly employed techniques for 
labeling oligonucleotides. Other labeling techniques can also be used. Unincorporated 
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label should preferably be removed by gel filtration chromatography or other established 
methods. The amount of radioactivity incorporated into the probe should be quantitated 
by measurement in a scintillation counter. Preferably, specific activity of the resulting 
probe should be approximately 4e+6 dpm/pmole. 
5 The bacterial culture containing the pool of full-length clones should preferably 

be thawed and 100 pi of the stock used to inoculate a sterile culture flask containing 25 ml 
of sterile L-broth containing ampicillin at 100 pg/ml. The culture should preferably be 
grown to saturation at 37°C, and the saturated culture should preferably be diluted in 
fresh L-broth. Aliquots of these dilutions should preferably be plated to determine the 

10 dilution and volume which will yield approximately 5000 distinct and well-separated 
colonies on solid bacteriological media containing L-broth containing ampicillin at 100 
pg/ml and agar at 15% in a 150 mm petri dish when grown overnight at 37°C. Other 
known methods of obtaining distinct, well-separated colonies can also be employed. 

Standard colony hybridization procedures should then be used to transfer the 

15 colonies to nitrocellulose filters and lyse, denature and bake them. 

The filter is then preferably incubated at 65°C for 1 hour with gentle agitation in 
6X SSC (20X stock is 175.3 g Nad/liter, 88.2 g Na citrate/liter, adjusted to pH 7.0 with 
NaOH) containing 05% SDS, 100 pg/ml of yeast RNA, and 10 mM EDTA (approximately 
10 mL per 150 mm filter). Preferably, the probe is then added to the hybridization mix at 

20 a concentration greater than or equal to le+6 dpm/mL. The filter is then preferably 
incubated at 65°C with gentle agitation overnight. The filter is then preferably washed in 
500 mL of 2X SSC/05% SDS at room temperature without agitation, preferably followed 
by 500 mL of 2X SSC/0.1% SDS at room temperature with gentle shaking for 15 minutes. 
A third wash with 0.1X SSC/0.5% SDS at 65°C for 30 minutes to 1 hour is optional. The 

25 filter is then preferably dried and subjected to autoradiography for sufficient time to 
visualize the positives on the X-ray film. Other known hybridization methods can also 
be employed. 

The positive colonies are picked, grown in culture, and plasmid DNA isolated 
using standard procedures. The clones can then be verified by restriction analysis, 
3 0 hybridization analysis, or DNA sequencing. 

Fragments of the proteins of the present invention which are capable of exhibiting 
biological activity are also encompassed by the present invention. Fragments of the 
protein may be in linear form or they may be cyclized using known methods, for example, 
as described in H.U. Saragovi, el al, Bio/Technology 10, 773-778 (1992) and in R.S. 
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McDowell, et al., J. Amer. Chem. Soc 114, 9245-9253 (1992), bo* of which are incorporated 
herein by reference. Such fragments may be fused to carrier molecules such as 
immunoglobulins for many purposes, including increasing the valency of protein binding 
sites. For example, fragments of the protein may be fused through "linker" sequences to 
5 the Fc portion of an immunoglobulin. For a bivalent form of the protein, such a fusion 
could be to the Fc portion of an IgG molecule. Other immunoglobulin isotypes may also 
be used to generate such fusions. For example, a protein - IgM fusion would generate a 
decavalent form of the protein of the invention. 

The present invention also provides both full-length and mature forms of the 

1 0 disclosed proteins. The full-length form of the such proteins is identified in the sequence 
listing by translation of die nucleotide sequence of each disclosed clone. The mature form 
of such protein may be obtained by expression of the disclosed full-length polynucleotide 
(preferably those deposited with ATCC) in a suitable mammalian cell or other host cell. 
The sequence of the mature form of. the protein may also be determinable from the amino 

1 5 acid sequence of the full-length form. 

The present invention also provides genes corresponding to the cDNA sequences 
disclosed herein. "Corresponding genes" are the regions of the genome that are 
transcribed to produce the mRN As from which the cDN A sequences are derived and may 
include contiguous regions of the genome necessary for the regulated expression of such 

20 genes. Corresponding genes may therefore include but are not limited to coding 
sequences, 5' and 3' untranslated regions, alternatively spliced exons, introns, promoters, 
enhancers, and silencer or suppressor elements. The corresponding genes can be isolated 
in accordance with known methods using the sequence information disclosed herein. 
Such methods include the preparation of probes or primers from die disclosed sequence 

25 information for identification and/or amplification of genes in appropriate genomic 
libraries or other sources of genomic materials. An "isolated gene" is a gene that has been 
separated from the adjacent coding sequences, if any, present in the genome of the 
organism from which the gene was isolated. 

Where the protein of the present invention is membrane-bound (e.g., is a receptor), 

3 0 the present invention also provides for soluble forms of such protein. In such forms part 
or all of the intracellular and transmembrane domains of the protein are deleted such that 
the protein is fully secreted from the cell in which it is expressed. The intracellular and 
transmembrane domains of proteins of the invention can be identified in accordance with 
known techniques for determination of such domains from sequence information. 
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Proteins and protein fragments of the present invention include proteins with 
amino acid sequence lengths that are at least 25%(more preferably at least 50%, and most 
preferably at least 75%) of the length of a disclosed protein and have at least 60% sequence 
identity (more preferably, at least 75% identity; most preferably at least 90% or 95% 
5 identity) with that disclosed protein, where sequence identity is determined by comparing 
the amino acid sequences of the proteins when aligned so as to maximize overlap and 
identity while minimizing sequence gaps. Also included in the present invention are 
proteins and protein fragments that contain a segment preferably comprising 8 or more 
(more preferably 20 or more, most preferably 30 or more) contiguous amino acids that 
10 shares at least 75% sequence identity (more preferably, at least 85% identity; most 
preferably at least 95% identity) with any such segment of any of the disclosed proteins. 

Species homologs of the disclosed polynucleotides and proteins are also provided 
by the present invention. As used herein, a "species homologue" is a protein or 
polynucleotide with a different species of origin from that of a given protein or 
15 polynucleotide, but with significant sequence similarity to the given protein or 
polynucleotide, as determined by those of skill in the art. Species homologs may be 
isolated and identified by making suitable probes or primers from the sequences provided 
herein and screening a suitable nucleic acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides 
20 or proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide 
which also encode proteins which are identical, homologous, or related to that encoded 
by the polynucleotides . 

The invention also includes polynucleotides with sequences complementary to 
those of the polynucleotides disclosed herein. 

2 5 The present invention also includes polynucleotides capable of hybridizing 

under reduced stringency conditions, more preferably stringent conditions, and most 
preferably highly stringent conditions, to polynucleotides described herein. Examples of 
stringency conditions are shown in the table below: highly stringent conditions are those 
that are at least as stringent as, for example, conditions A-F; stringent conditions are at 

3 0 least as stringent as, for example, conditions G-L; and reduced stringency conditions are 

at least as stringent as, for example, conditions M-R. 
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Stringency 
Condition 


Polynucleotide 
Hybrid 


Hybrid 
Length 
(bp)* 


Hybridization Temperature and 
Buffer* 


Wash 

Temperature 
and Buffer* 


A 


DNA:DNA 


*50 


65°C; IxSSC -or- 

42°C; IxSSC, 50% formamide 


65°C; 03xSSC 


B 


DNAiDNA 


<50 


V; IxSSC 


V; IxSSC 


C 


DNAJUMA 


*50 


67°C; IxSSC -or- 

45°C; IxSSC, 50% formamide 


67°C; 03xSSC 


D 


DNAJUMA 


<50 


V; IxSSC 


T D *; IxSSC 


E 


RNA:RNA 


*50 


70°C; IxSSC -or- 

50°C; IxSSC, 50% formamide 


70°C; 03xSSC 


F 


RNAzRNA 


<50 


V; IxSSC 


T F *; IxSSC 


G 


DNArDNA 


i50 


65°C; 4xSSC -or- 

42°C; 4xSSC, 50% formamide 


65°C; IxSSC 


H 


DNAiDNA 


<50 


T H *; 4xSSC 


T H *; 4xSSC 


I 


DNAiRNA 


i 50 


67°C; 4xSSC -ot- 

45°C; 4xSSC, 50% formamide 


67°C; IxSSC 


J 


DNA*NA 


<50 


T,»; 4xSSC 


T J *;4xSSC 


K 


RNAiRNA 


z 50 


70°C; 4xSSC -or- 

50°C; 4xSSC, 50% formamide 


67°C; IxSSC 


L 


RNA:RNA 


<50 


T L *;2xSSC 


T L *;2xSSC 


M 


DNA:DNA 


* 50 


50°C; 4xSSC -or- 

40°C; 6xSSC, 50% formamide 


50°C;2xSSC 


N 


DNArDNA 


<50 


T N »; 6xSSC 


T N *; 6xSSC 


O 


DNAJtNA 


*50 


55°C; 4xSSC -or- 

42°C; 6xSSC, 50% formamide 


55°C;2xSSC 


P 


DNAJuMA 


<50 


Tp*i 6xSSC 


T P *; 6xSSC 


Q 


RNArRNA 


*50 


60°C; 4xSSC -or- 

45°C; 6xSSC, 50% formamide 


60°C; ?xSSC 


R 


RNA:RNA 


<50 


T B *;4xSSC 


V;4xSSC 



h The hybrid length is that anticipated for the hybridized region(s) of the hybridizing polynucleotides. When 
hybridizing a polynucleotide to a target polynucleotide of unknown sequence, the hybrid length is assumed 
to be that of the hybridizing polynucleotide. When polynucleotides of known sequence are hybridized, the 

2 5 hybrid length can be determined by aligning the sequences of the polynucleotides and identifying the region 

or regions of optimal sequence complementarity. 

SSPE (lxSSPE is 0.15M Nad, lOmM NaH 2 PCV and 1.25mM EDTA, pH 7.4) can be substituted for SSC 
(IxSSC is 0.15M NaCl and 15mM sodium citrate) in the hybridization and wash buffers; washes are 
performed for 15 minutes after hybridization is complete. 
30 *T B - T R : The hybridization temperature for hybrids anticipated to be less than 50 base pairs in length should 
be 5-10°C less than the melting temperature (T^ of the hybrid, where T m is determined according to the 
following equations. For hybrids less than 18 base pairs in length, T m (°C) = 2(# of A + T bases) + 4(# of G + 
C bases). For hybrids between 18 and 49 base pairs in length, T m (°Q = 815 + 16.6(log 10 [Na + ]) + 0.41 (%G+C) - 
(600/N), where N is the number of bases in the hybrid, and [Na*] is the concentration of sodium ions in the 

3 5 hybridization buffer ([Na*] for IxSSC = 0.165 M). 
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Additional examples of stringency conditions for polynucleotide hybridization are 
provided in Sambrook, J., EJ\ Fritsch, and T. Mania tis, 1989, Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
chapters 9 and 11, and Current Protocols in Molecular Biology, 1995, F.M. Ausubel et aL, eds., 
5 John Wiley & Sons, Inc., sections 2.10 and 6.3-6.4, incorporated herein by reference. 

Preferably, each such hybridizing polynucleotide has a length that is at least 
25%(more preferably at least 50%, and most preferably at least 75%) of the length of the 
polynucleotide of the present invention to which it hybridizes, and has at least 60% 
sequence identity (more preferably, at least 75% identity; most preferably at least 90% or 
10 95% identity) with the polynucleotide of the present invention to which it hybridizes, 
where sequence identity is determined by comparing the sequences of the hybridizing 
polynucleotides when aligned so as to maximize overlap and identity while minimizing 
sequence gaps. 

The isolated polynucleotide of the invention may be operably linked to an 
15 expression control sequence such as the pMT2 or pED expression vectors disclosed in 
Kaufman et al., Nucleic Acids Res. 19, 4485-4490 (1991), in order to produce the protein 
recombinantly. Many suitable expression control sequences are known in the art. General 
methods of expressing recombinant proteins are also known and are exemplified in R. 
Kaufman, Methods in Enzymology 185, 537-566 (1990). As defined herein "operably 

2 0 linked" means that the isolated polynucleotide of the invention and an expression control 

sequence are situated within a vector or cell in such a way that the protein is expressed 
by a host cell which has been transformed (transfected) with the ligated 
polynucleotide/expression control sequence. 

A number of types of cells may act as suitable host cells for expression of the 
25 protein. Mammalian host cells include, for example, monkey COS cells, Chinese Hamster 
Ovary (CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 
cells, 3T3 cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell 
strains derived from in vitro culture of primary tissue, primary explants, HeLa cells, 
mouse L cells, BHK, HL-60, U937, HaK or Jurkat cells. 

3 0 Alternatively, it may be possible to produce the protein in lower eukaryotes such 

as yeast or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or any 
yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 
strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 
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strain capable of expressing heterologous proteins. If the protein is made in yeast or 
bacteria, it may be necessary to modify the protein produced therein, for example by 
phosphorylation or glycosylation of the appropriate sites, in order to obtain the functional 
protein. Such covalent attachments may be accomplished using known chemical or 
5 enzymatic methods. 

The protein may also be produced by operably linking the isolated polynucleotide 
of the invention to suitable control sequences in one or more insect expression vectors, 
and employing an insect expression system. Materials and methods for 
baculo virus /insect cell expression systems are commercially available in kit form from, 
10 e.g., Invitrogen, San Diego, California, US. A. (the MaxBac® kit), and such methods are 
well known in the art, as described in Summers and Smith, Texas Agricultural Experiment 
Station Bulletin No. 1555 (1987), incorporated herein by reference. As used herein, an 
insect cell capable of expressing a polynucleotide of the present invention is 
"transformed." 

1 5 The protein of the invention may be prepared by culturing transformed host cells 

under culture conditions suitable to express the recombinant protein. The resulting 
expressed protein may then be purified from such culture (i.e., from culture medium or 
cell extracts) using known purification processes, such as gel filtration and ion exchange 
chromatography. The purification of the protein may also include an affinity column 

2 0 containing agents which will bind to the protein; one or more column steps over such 
affinity resins as concanavalin A-agarose, heparin-toyopearl® or Cibacrom blue 3GA 
Sepharose®; one or more steps involving hydrophobic interaction chromatography using 
such resins as phenyl ether, butyl ether, or propyl ether; or immunoaffinity 
chromatography. 

2 5 Alternatively, the protein of the invention may also be expressed in a form which 

will facilitate purification. For example, it may be expressed as a fusion protein, such as 
those of maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin 
(TRX). Kits for expression and purification of such fusion proteins are commercially 
available from New England BioLab (Beverly, MA), Pharmacia (Piscataway, NJ) and 

30 InVitrogen, respectively. The protein can also be tagged with an epitope and 
subsequently purified by using a specific antibody directed to such epitope. One such 
epitope ("Flag") is commercially available from Kodak (New Haven, CT). 

Finally, one or more reverse-phase high performance liquid chromatography (RP- 
HPLC) steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant 
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methyl or other aliphatic groups, can be employed to further purify the protein. Some or 
all of the foregoing purification steps, in various combinations, can also be employed to 
provide a substantially homogeneous isolated recombinant protein. The protein thus 
purified is substantially free of other mammalian proteins and is defined in accordance 
5 with the present invention as an "isolated protein." 

The protein of the invention may also be expressed as a product of transgenic 
animals, e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which 
are characterized by somatic or germ cells containing a nucleotide sequence encoding the 
protein. 

10 The protein may also be produced by known conventional chemical synthesis. 

Methods for constructing the proteins of the present invention by synthetic means are 
known to those skilled in the art. The synthetically-constructed protein sequences, by 
virtue of sharing primary, secondary or tertiary structural and/or conformational 
characteristics with proteins may possess biological properties in common therewith, 

15 including protein activity. Thus, they may be employed as biologically active or 
immunological substitutes for natural, purified proteins in screening of therapeutic 
compounds and in immunological processes for the development of antibodies. 

The proteins provided herein also include proteins characterized by amino acid 
sequences similar to those of purified proteins but into which modification are naturally 

2 0 provided or deliberately engineered. For example, modifications in the peptide or DNA 
sequences can be made by those skilled in the art using known techniques. Modifications 
of interest in the protein sequences may include the alteration, substitution, replacement, 
insertion or deletion of a selected amino acid residue in the coding sequence. For 
example, one or more of the cysteine residues may be deleted or replaced with another 

25 amino acid to alter the conformation of the molecule. Techniques for such alteration, 
substitution, replacement, insertion or deletion are well known to those skilled in the art 
(see, e.g., US. Patent No. 4,518,584). Preferably, such alteration, substitution, replacement, 
insertion or deletion retains the desired activity of the protein. 

Other fragments and derivatives of the sequences of proteins which would be 

30 expected to retain protein activity in whole or in part and may thus be useful for screening 
or other immunological methodologies may also be easily made by those skilled in the art 
given the disclosures herein. Such modifications are believed to be encompassed by the 
present invention. 
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USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit 
one or more of the uses or biological activities (including those associated with assays 
cited herein) identified below. Uses or activities described for proteins of the present 
5 invention may be provided by administration or use of such proteins or by administration 
or use of polynucleotides encoding such proteins (such as, for example, in gene therapies 
or vectors suitable for introduction of DNA). 

Research Uses and Utilities 

1 0 The polynucleotides provided by the present invention can be used by the research 

community for various purposes. The polynucleotides can be used to express 
recombinant protein for analysis, characterization or therapeutic use; as markers for 
tissues in which the corresponding protein is preferentially expressed (either 
constitutively or at a particular stage of tissue differentiation or development or in disease 

15 states); as molecular weight markers on Southern gels; as chromosome markers or tags 
(when labeled) to identify chromosomes or to map related gene positions; to compare 
with endogenous DNA sequences in patients to identify potential genetic disorders; as 
probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" 

2 0 known sequences in the process of discovering other novel polynucleotides; for selecting 

and making oligomers for attachment to a "gene chip" or other support, including for 
examination of expression patterns; to raise anti-protein antibodies using DNA 
immunization techniques; and as an antigen to raise anti-DNA antibodies or elicit another 
immune response. Where the polynucleotide encodes a protein which binds or potentially 
25 binds to another protein (such as, for example, in a receptor-ligand interaction), the 
polynucleotide can also be used in interaction trap assays (such as, for example, that 
described in Gyuris et aL, Cell 75:791-803 (1993)) to identify polynucleotides encoding the 
other protein with which binding occurs or to identify inhibitors of the binding 
interaction. 

3 0 The proteins provided by the present invention can similarly be used in assay to 

determine biological activity, including in a panel of multiple proteins for high- 
throughput screening; to raise antibodies or to elicit another immune response; as a 
reagent (including the labeled reagent) in assays designed to quantitatively determine 
levels of the protein (or its receptor) in biological fluids; as markers for tissues in which 
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the corresponding protein is preferentially expressed (either constitutively or at a 
particular stage of tissue differentiation or development or in a disease state); and, of 
course, to isolate correlative receptors or ligands. Where the protein binds or potentially 
binds to another protein (such as, for example, in a receptor-ligand interaction), the 
5 protein can be used to identify the other protein with which binding occurs or to identify 
inhibitors of the binding interaction. Proteins involved in these binding interactions can 
also be used to screen for peptide or small molecule inhibitors or agonists of the binding 
interaction. 

Any or all of these research utilities are capable of being developed into reagent 
1 0 grade or kit format for commercialization as research products. 

Methods for performing the uses listed above are well known to those skilled in 
the art. References disclosing such methods include without limitation "Molecular 
Cloning: A Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, 
J., E.F. Fritsch and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to 
1 5 Molecular Cloning Techniques", Academic Press, Berger, S.L. and A.R. Kimmel eds., 1987. 

Nutritional Uses 

Polynucleotides and proteins of the present invention can also be used as 
nutritional sources or supplements. Such uses include without limitation use as a protein 

20 or amino acid supplement, use as a carbon source, use as a nitrogen source and use as a 
source of carbohydrate. In such cases the protein or polynucleotide of the invention can 
be added to the feed of a particular organism or can be administered as a separate solid 
or liquid preparation, such as in the form of powder, pills, solutions, suspensions or 
capsules. In the case of microorganisms, the protein or polynucleotide of the invention 

2 5 can be added to the medium in or on which the microorganism is cultured. 



Cytokine and Cell Proliferation/Differentiation Activity 

A protein of the present invention may exhibit cytokine, cell proliferation (either 
3 0 inducing or inhibiting) or cell differentiation (either inducing or inhibiting) activity or may 
induce production of other cytokines in certain cell populations. Many protein factors 
discovered to date, including all known cytokines, have exhibited activity in one or more 
factor dependent cell proliferation assays, and hence the assays serve as a convenient 
confirmation of cytokine activity. The activity of a protein of the present invention is 
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evidenced by any one of a number of routine factor dependent cell proliferation assays 
for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/11, BaF3, 
MC9/G, M+ (preB M+), 2E8, RB5, DAI, 123, T1165, HT2, CTLL2, TF-1, Mo7e and CMK. 

5 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assays for T-cell or thymocyte proliferation include without limitation those 

described in: Current Protocols in Immunology, Ed by J. E. Coligan, AM. Kruisbeek, D.H. 

Margulies, E.M. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
1 0 Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 

7, Immunologic studies in Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; 

Bertagnolli et aL, J. Immunol. 145:1706-1712, 1990; Bertagnolli et al., Cellular immunology 

133:327-341, 1991; Bertagnolli, et al., J. Immunol. 149:3778-3783, 1992; Bowman et aL, J. 

Immunol. 152: 1756-1761, 1994. 
1 5 Assays for cytokine production and/ or proliferation of spleen cells, lymph node 

cells or thymocytes include, without limitation, those described in: Polyclonal T cell 

stimulation, Kruisbeek, A.M. and Shevach, EM. In Current Protocols in Immunology. J.E.e.a. 

Coligan eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and 

Measurement of mouse and human Interferon y, Schreiber, R.D. In Current Protocols in 

2 0 Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic 
cells include, without limitation, those described in: Measurement of Human and Murine 
Interleukin 2 and Interleukin 4, Bottomly, K., Davis, L.S. and lipsky, P.E. In Current 
Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, 
25 Toronto. 1991; deVries et aL, J. Exp. Med. 173:1205-1211, 1991; Moreau et aL, Nature 
336:690-692, 1988; Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; 
Measurement of mouse and human interleukin 6 - Nordan, R. In Current Protocols in 
Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.6.1-6.65, John Wiley and Sons, Toronto. 1991; 
Smith et al., Proc. Natl. Acad. Sci. U.S.A. 83:1857-1861, 1986; Measurement of human 

3 0 Interleukin 11 - Bennett, F., Giannotti, J., Clark, S.C. and Turner, K. J. In Current Protocols 

in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.15.1 John Wiley and Sons, Toronto. 1991; 
Measurement of mouse and human Interleukin 9 - Ciarletta, A., Giannotti, J., Clark, S.C. 
and Turner, K.J. In Current Protocols in Immunology. J.E.e.a. Coligan eds. Vol 1 pp. 6.13.1, 
John Wiley and Sons, Toronto. 1991. 
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Assays for T-cell clone responses to antigens (which will identify, among others, 
proteins that affect APC-T cell interactions as well as direct T-cell effects by measuring 
proliferation and cytokine production) include, without limitation, those described in: 
Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, D.H. Margulies, 
5 EM. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley-Interscience 
(Chapter 3, In Vitro assays for Mouse Lymphocyte Function; Chapter 6, Cytokines and 
their cellular receptors; Chapter 7, Immunologic studies in Humans); Weinberger et al., 
Proc. Natl. Acad. Sci. USA 77:6091-6095, 1980; Weinberger et al., Eur. J. Immun. 
11:405-411, 1981; Takai et al., J. Immunol. 1373494-3500, 1986; Takai et al., J. Immunol. 
10 140:508-512, 1988. 

Immune Stimulating or Suppressing Activity 

* 

A protein of the present invention may also exhibit immune stimulating or 
immune suppressing activity, including without limitation the activities for which assays 

15 are described herein. A protein may be useful in the treatment of various immune 
deficiencies and disorders (including severe combined immunodeficiency (SCID)), e.g., 
in regulating (up or down) growth and proliferation of T and /or B lymphocytes, as well 
as effecting the cytolytic activity of NK cells and other cell populations. These immune 
deficiencies may be genetic or be caused by viral (e.g., HIV) as well as bacterial or fungal 

20 infections, or may result from autoimmune disorders. More specifically, infectious 
diseases causes by viral, bacterial, fungal or other infection may be treatable using a 
protein of the present invention, including infections by HIV, hepatitis viruses, 
herpesviruses, mycobacteria, Leishmarda spp., malaria spp. and various fungal infections 
such as candidiasis. Of course, in this regard, a protein of the present invention may also 

25 be useful where a boost to the immune system generally may be desirable, le., in the 
treatment of cancer. 

Autoimmune disorders which may be treated using a protein of the present 
invention include, for example, connective tissue disease, multiple sclerosis, systemic 
lupus erythematosus, rheumatoid arthritis, autoimmune pulmonary inflammation, 
3 0 Guillain-Barre syndrome, autoimmune thyroiditis, insulin dependent diabetes mellitis, 
myasthenia gravis, graft-versus-host disease and autoimmune inflammatory eye disease. 
Such a protein of the present invention may also to be useful in the treatment of allergic 
reactions and conditions, such as asthma (particularly allergic asthma) or other respiratory 
problems. Other conditions, in which immune suppression is desired (including, for 
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example, organ transplantation), may also be treatable using a protein of the present 
invention. 

Using the proteins of the invention it may also be possible to immune responses, 
in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
5 immune response already in progress or may involve preventing the induction of an 
immune response. The functions of activated T cells may be inhibited by suppressing T 
cell responses or by inducing specific tolerance in T cells, or both. Immunosuppression 
of T cell responses is generally an active, non-antigen-specific, process which requires 
continuous exposure of the T cells to the suppressive agent. Tolerance, which involves 
10. inducing non-responsiveness . or anergy in T cells, is distinguishable from 
immunosuppression in that it is generally antigen-specific and persists after exposure to 
the tolerizing agent has ceased. Operationally, tolerance can be demonstrated by the lack 
of a T cell response upon reexposure to specific antigen in the absence of the tolerizing 
agent 

1 5 Down regulating or preventing one or more antigen functions (including without 

limitation B lymphocyte antigen functions (such as , for example, B7)), e.g., preventing 
high level lymphokine synthesis by activated T cells, will be useful in situations of tissue, 
skin and organ transplantation and in graft-versus-host disease (GVHD). For example, 
blockage of T cell function should result in reduced tissue destruction in tissue 

20 transplantation. Typically, in tissue transplants, rejection of the transplant is initiated 
through its recognition as foreign by T cells, followed by an immune reaction that destroys 
the transplant. The administration of a molecule which inhibits or blocks interaction of 
a B7 lymphocyte antigen with its natural ligand(s) on immune cells (such as a soluble, 
monomeric form of a peptide having B7-2 activity alone or in conjunction with a 

2 5 monomeric form of a peptide having an activity of another B lymphocyte antigen (e.g., B7- 
1, B7-3) or blocking antibody), prior to transplantation can lead to the binding of the 
molecule to the natural ligand(s) on the immune cells without transmitting the 
corresponding costimulatory signal. Blocking B lymphocyte antigen function in this 
matter prevents cytokine synthesis by immune cells, such as T cells, and thus acts as an 

30 immunosuppressant. Moreover, the lack of costimulation may also be sufficient to 
anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term 
tolerance by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated 
administration of these blocking reagents. To achieve sufficient immunosuppression or 
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tolerance in a subject, it may also be necessary to block the function of a combination of 
B lymphocyte antigens. 

The efficacy of particular blocking reagents in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
5 humans. Examples of appropriate systems which can be used include allogeneic cardiac 
grafts in rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been 
used to examine the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as 
described in Lenschow et al., Science 257:789-792 (1992) and Turka et cd. t Proc. Natl. Acad. 
Sci USA, 59:11102-11105 (1992). In addition, murine models of GVHD (see Paul ed., 
1 0 Fundamental Immunology, Raven Press, New York, 1989, pp. 846-847) can be used to 
determine the effect of blocking B lymphocyte antigen function in vivo on the development 
of that disease. 

Blocking antigen function may also be therapeutically useful for treating 
autoimmune diseases. Many autoimmune disorders are the result of inappropriate 
1 5 activation of T cells that are reactive against self tissue and which promote the production 
of cytokines and autoantibodies involved in the pathology of the diseases. Preventing the 
activation of autoreactive T cells may reduce or eliminate disease symptoms. 
Administration of reagents which block costimulation of T cells by disrupting 
receptonligand interactions of B lymphocyte antigens can be used to inhibit T cell 

2 0 activation and prevent production of autoantibodies or T cell-derived cytokines which 

may be involved in the disease process. Additionally, blocking reagents may induce 
antigen-specific tolerance of autoreactive T cells which could lead to long-term relief from 
the disease. The efficacy of blocking reagents in preventing or alleviating autoimmune 
disorders can be determined using a number of well-characterized animal models of 
25 human autoimmune diseases. Examples include murine experimental autoimmune 
encephalitis, systemic lupus erythmatosis in MRL/lpr/lpr mice or NZB hybrid mice, 
murine autoimmune collagen arthritis, diabetes mellitus in NOD mice and BB rats, and 
murine experimental myasthenia gravis (see Paul ed., Fundamental Immunology, Raven 
Press, New York, 1989, pp. 840-856). 

3 0 Upregulation of an antigen function (preferably a B lymphocyte antigen function), 

as a means of up regulating immune responses, may also be useful in therapy. 
Upregulation of immune responses may be in the form of enhancing an existing immune 
response or eliciting an initial immune response. For example, enhancing an immune 
response through stimulating B lymphocyte antigen function may be useful in cases of 
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viral infection. In addition, systemic viral diseases such as influenza, the common cold, 
and encephalitis might be alleviated by the administration of stimulatory forms of B 
lymphocyte antigens systemically. 

Alternatively, anti-viral immune responses may be enhanced in an infected patient 
5 by removing T cells from the patient, costimulating the T cells in vitro with viral antigen- 
pulsed APCs either expressing a peptide of the present invention or together with a 
stimulatory form of a soluble peptide of the present invention and reintroducing the in 
vitro activated T cells into the patient. Another method of enhancing anti-viral immune 
responses would be to isolate infected cells from a patient, transf ect them with a nucleic 

1 0 acid encoding a protein of the present invention as described herein such that the cells 
express all or a portion of the protein on their surface, and reintroduce the transfected 
cells into the patient The infected cells would now be capable of delivering a 
costimulatory signal to, and thereby activate, T cells in vivo. 

In another application, up regulation or enhancement of antigen function 

15 (preferably B lymphocyte antigen function) may be useful in the induction of tumor 
immunity. Tumor cells (e.g., sarcoma, melanoma, lymphoma, leukemia, neuroblastoma, 
carcinoma) transfected with a nucleic acid encoding at least one peptide of the present 
invention can be administered to a subject to overcome tumor-specific tolerance in the 
subject. If desired, the tumor cell can be transfected to express a combination of peptides. 

20 For example, tumor cells obtained from a patient can be transfected ex vivo with an 
expression vector directing the expression of a peptide having B7-2-like activity alone, or 
in conjunction with a peptide having B7-l-like activity and/or B7-3-like activity. The 
transfected tumor cells are returned to the patient to result in expression of the peptides 
on the surface of the transfected cell. Alternatively, gene therapy techniques can be used 

25 to target a tumor cell for transfection in vivo. 

The presence of the peptide of the present invention having the activity of a B 
lymphocyte antigen(s) on the surface of the tumor cell provides the necessary 
costimulation signal to T cells to induce a T cell mediated immune response against the 
transfected tumor cells. In addition, tumor cells which lack MHC class I or MHC class II 

3 0 molecules, or which fail to reexpress sufficient amounts of MHC class I or MHC class II 
molecules, can be transfected with nucleic acid encoding all or a portion of (e.g., a 
cytoplasmic-domain truncated portion) of an MHC class I a chain protein and P 2 
microglobulin protein or an MHC class II a chain protein and an MHC class II P chain 
protein to thereby express MHC class I or MHC class II proteins on the cell surface. 
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Expression of the appropriate class I or class II MHC in conjunction with a peptide having 
the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T cell mediated 
immune response against the transfected tumor cell. Optionally, a gene encoding an 
antisense construct which blocks expression of an MHC class II associated protein, such 
5 as the invariant chain, can also be cotransfected with a DNA encoding a peptide having 
the activity of a B lymphocyte antigen to promote presentation of tumor associated 
antigens and induce tumor specific immunity. Thus, the induction of a T cell mediated 
immune response in a human subject may be sufficient to overcome tumor-specific 
tolerance in the subject 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without 
limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, AM. 
Kruisbeek, D.H. Margulies, EM. Shevach, W Strober, Pub. Greene Publishing Associates 

1 5 and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1- 
3.19; Chapter 7, Immunologic studies in Humans); Herrmann et aL, Proc. Natl. Acad. Sci. 
USA 78:2488-2492, 1981; Herrmann et aL, J. Immunol. 128:1968-1974, 1982; Handa et aL, 
J. Immunol 135:1564-1572, 1985; Takai et al., J. Immunol 137:3494-3500, 1986; Takai et aL, 
J. Immunol 140508-512, 1988; Herrmann et al., Proc. Natl. Acad. Sci. USA 78:2488-2492, 

20 1981; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. Immunol. 
135:1564-1572, 1985; Takai et al., J. Immunol. 137:3494-3500, 1986; Bowmanet al., J. 
Virology 61:1992-1998; Takai et al., J. Immunol. 140:508-512, 1988; Bertagnolli et al., 
Cellular Immunology 133327-341, 1991; Brown et al., J. Immunol. 153:3079-3092, 1994. 
Assays for T-cell-dependent immunoglobulin responses and isotype switching 

25 (which will identify, among others, proteins that modulate T-cell dependent antibody 
responses and that affect Thl/Th2 profiles) include, without limitation, those described 
in: Maliszewski, J. Immunol. 144:3028-3033, 1990; and Assays for B cell function: In vitro 
antibody production, Mond, J.J. and Brunswick, M. In Current Protocols in Immunology. 
J.E.e.a. Coligan eds. Vol 1 pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

3 0 Mixed lymphocyte reaction (MLR) assays (which will identify, among others, 

proteins that generate predominantly Thl and CTL responses) include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A.M. Kruisbeek, 
D.H. Margulies, EM. Shevach, W Strober, Pub. Greene Publishing Associates and Wiley- 
Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 
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7, Immunologic studies in Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai 
et aL, J. Immunol. 140508-512, 1988; Bertagnolli et aL, J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins 
expressed by dendritic cells that activate naive T-cells) include, without limitation, those 
5 described in: Guery et aL, J. Immunol. 134536-544, 1995; Inaba et aL, Journal of 
Experimental Medicine 173549-559, 1991; Macatonia et aL, Journal of Immunology 
1545071-5079, 1995; Porgador et aL, Journal of Experimental Medicine 182:255-260, 1995; 
Nair et aL, Journal of Virology 67:4062-4069, 1993; Huang et aL, Science 264:961-965, 
1994; Macatonia et aL, Journal of Experimental Medicine 169:1255-1264, 1989; Bhardwaj 

10 et aL, Journal of Clinical Investigation 94:797-807, 1994; and Inaba et aL, Journal of 
Experimental Medicine 172:631-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, 
proteins that prevent apoptosis after superantigen induction and proteins that regulate 
lymphocyte homeostasis) include, without limitation, those described in: Darzynkiewicz 

15 et aL, Cytometry 13:795-808, 1992; Gorczyca et aL, Leukemia 7:659-670, 1993; Gorczyca et 
aL, Cancer Research 53:1945-1951, 1993; Itoh et aL, Cell 66:233-243, 1991; Zacharchuk, 
Journal of Immunology 145:4037-4045, 1990; Zamai et aL, Cytometry 14:891-897, 1993; 
Gorczyca et aL, International Journal of Oncology 1:639-648, 1992. 

Assays for proteins that influence early steps of T-cell commitment and 

20 development include, without limitation, those described in: Antica et aL, Blood 
84:111-117, 1994; Fine et aL, Cellular Immunology 155:111-122, 1994; Galy et aL, Blood 
85:2770-2778, 1995; Toki et aL, Proc. Nat Acad Sci. USA 88:7548-7551, 1991. 

Hematopoiesis Regulating Activity 

25 A protein of the present invention may be useful in regulation of hematopoiesis 

and, consequently, in the treatment of myeloid or lymphoid cell deficiencies. Even 
marginal biological activity in support of colony forming cells or of factor-dependent cell 
lines indicates involvement in regulating hematopoiesis, e.g. in supporting the growth and 
proliferation of erythroid progenitor cells alone or in combination with other cytokines, 

30 thereby indicating utility, for example, in treating various anemias or for use in 
conjunction with irradiation/chemotherapy to stimulate the production of erythroid 
precursors and /or erythroid cells; in supporting the growth and proliferation of myeloid 
cells such as granulocytes and monocytes/macrophages (i.e., traditional CSF activity) 
useful, for example, in conjunction with chemotherapy to prevent or treat consequent 
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myelo-suppression; in supporting the growth and proliferation of megakaryocytes and 
consequently of platelets thereby allowing prevention or treatment of various platelet 
disorders such as thrombocytopenia, and generally for use in place of or complimentary 
to platelet transfusions; and /or in supporting the growth and proliferation of 
5 hematopoietic stem cells which are capable of maturing to any and all of the above- 
mentioned hematopoietic cells and therefore find therapeutic utility in various stem cell 
disorders (such as those usually treated with transplantation, including, without 
limitation, aplastic anemia and paroxysmal nocturnal hemoglobinuria), as well as in 
repopulating the stem cell compartment post irradiation/chemotherapy, either in-vivo or 

10 ex-vivo (i.e., in conjunction with bone marrow transplantation or with peripheral 
progenitor cell transplantation (homologous or heterologous)) as normal cells or 
genetically manipulated for gene therapy. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

1 5 Suitable assays for proliferation and differentiation of various hematopoietic lines 

are cited above. 

Assays for embryonic stem cell differentiation (which will identify, among others, 
proteins that influence embryonic differentiation hematopoiesis) include, without 
limitation, those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et 
20 al., Molecular and Cellular Biology 13:473-486, 1993; McClanahan et al., Blood 
81:2903-2915, 1993. 

Assays for stem cell survival and differentiation (which will identify, among 
others, proteins that regulate lympho-hematopoiesis) include, without limitation, those 
described in: Methylcellulose colony forming assays, Freshney, M.G. In Culture of 

2 5 Hematopoietic Cells. R.L Freshney, et al. eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, 

NY. 1994; Hirayama et al., Proc Natl. Acad. Sci. USA 895907-5911, 1992; Primitive 
hematopoietic colony forming cells with high proliferative potential, McNiece, I JC. and 
Briddell, R.A. In Culture of Hematopoietic Cells. R.I. Freshney, et al. eds. Vol pp. 23-39, 
Wiley-Liss, Inc., New York, NY. 1994; Neben et al., Experimental Hematology 22:353-359, 

3 0 1994; Cobblestone area forming cell assay, Ploemacher, R.E. In Culture of Hematopoietic 

Cells. R.I. Freshney, et al eds. Vol pp. 1-21, Wiley-Liss, Inc., New York, NY. 1994; Long 
term bone marrow cultures in the presence of stromal cells, Spooncer, E., Dexter, M. and 
Allen, T. In Culture of Hematopoietic Cells. ILL Freshney, et al. eds. Vol pp. 163-179, 
Wiley-Liss, Inc., New York, NY. 1994; Long term culture initiating cell assay, Sutherland, 
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HJ. In Culture of Hematopoietic Cells. RJ. Freshney, et (d. eds. Vol pp. 139-162, Wiley-Liss, 
Inc., New York, NY. 1994. 

Tissue Growth Activity 
5 A protein of the present invention also may have utility in compositions used for 

bone, cartilage, tendon, ligament and/or nerve tissue growth or regeneration, as well as 
for wound healing and tissue repair and replacement, and in the treatment of burns, 
incisions and ulcers. 

A protein of the present invention, which induces cartilage and/ or bone growth 
10 in circumstances where bone is not normally formed, has application in the healing of 
bone fractures and cartilage damage or defects in humans and other animals. Such a 
preparation employing a protein of the invention may have prophylactic use in closed as 
well as open fracture reduction and also in the improved fixation of artificial joints. De 
novo bone formation induced by an osteogenic agent contributes to the repair of 
1 5 congenital, trauma induced, or oncologic resection induced craniofacial defects, and also 
is useful in cosmetic plastic surgery. 

A protein of this invention may also be used in the treatment of periodontal 
disease, and in other tooth repair processes. Such agents may provide an environment 
to attract bone-forming cells, stimulate growth of bone-forming cells or induce 

2 0 differentiation of progenitors of bone-forming cells. A protein of the invention may also 

be useful in the treatment of osteoporosis or osteoarthritis, such as through stimulation 
of bone and/or cartilage repair or by blocking inflammation or processes of tissue 
destruction (collagenase activity, osteoclast activity, etc.) mediated by inflammatory 
processes. 

25 Another category of tissue regeneration activity that may be attributable to the 

protein of the present invention is tendon/ligament formation. A protein of the present 
invention, which induces tendon/ligament-like tissue or other tissue formation in 
circumstances where such tissue is not normally formed, has application in the healing of 
tendon or ligament tears, deformities and other tendon or ligament defects in humans and 

3 0 other animals. Such a preparation employing a tendon/ligament-like tissue inducing 

protein may have prophylactic use in preventing damage to tendon or ligament tissue, as 
well as use in the improved fixation of tendon or ligament to bone or other tissues, and 
in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue 
formation induced by a composition of the present invention contributes to the repair of 
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. congenital, trauma induced, or other tendon or ligament defects of other origin, and is also 
useful in cosmetic plastic surgery for attachment or repair of tendons or ligaments. The 
compositions of the present invention may provide an environment to attract tendon- or 
ligament-forming cells, stimulate growth of tendon- or ligament-forming cells, induce 
5 differentiation of progenitors of tendon- or ligament-forming cells, or induce growth of 
tendon/ligament cells or progenitors ex vivo for return in vivo to effect tissue repair. The 
compositions of the invention may also be useful in the treatment of tendinitis, carpal 
tunnel syndrome and other tendon or ligament defects. The compositions may also 
include an appropriate matrix and/or sequestering agent as a carrier as is well known in 
10 the art. 

The protein of the present invention may also be useful for proliferation of neural 
cells and for regeneration of nerve and brain tissue, ix. for the treatment of central and 
peripheral nervous system diseases and neuropathies, as well as mechanical and 
traumatic disorders, which involve degeneration, death or trauma to neural cells or nerve 

15 tissue. More specifically, a protein may be used in the treatment of diseases of the 
peripheral nervous system, such as peripheral nerve injuries, peripheral neuropathy and 
localized neuropathies, and central nervous system diseases, such as Alzheimer's, 
Parkinson's disease, Huntington's disease, amyotrophic lateral sclerosis, and Shy-Drager 
syndrome. Further conditions which may be treated in accordance with the present 

2 0 invention include mechanical and traumatic disorders, such as spinal cord disorders, head 
trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies resulting 
from chemotherapy or other medical therapies may also be treatable using a protein of the 
invention. 

Proteins of the invention may also be useful to promote better or faster closure of 

2 5 non-healing wounds, including without limitation pressure ulcers, ulcers associated with 

vascular insufficiency, surgical and traumatic wounds, and the like. 

It is expected that a protein of the present invention may also exhibit activity for 
generation or regeneration of other tissues, such as organs (including, for example, 
pancreas, liver, intestine, kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) 

3 0 and vascular (including vascular endothelium) tissue, or for promoting the growth of cells 

comprising such tissues. Part of the desired effects may be by inhibition or modulation 
of fibrotic scarring to allow normal tissue to regenerate. A protein of the invention may 
also exhibit angiogenic activity. 
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A protein of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, 
and conditions resulting from systemic cytokine damage. 

A protein of the present invention may also be useful for promoting or inhibiting 
5 differentiation of tissues described above from precursor tissues or cells; or for inhibiting 
the growth of tissues described above. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for tissue generation activity include, without limitation, those described 
10 in: International Patent Publication No. WO95/16035 (bone, cartilage, tendons- 
International Patent Publication No. WO95/05846 (nerve, neuronal); International Patent 
Publication No. WO91/07491 (skin, endothelium ). 

Assays for wound healing activity include, without limitation, those described in: 
Winter, Epidermal Wound Healing, pps. 71-112 (Maibach, HI and Rovee, DT, eds.), Year 
15 Book Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. 
Dermatol 71:382-84 (1978). 



Activin/Inhibin Activity 

A protein of the present invention may also exhibit activin- or inhibin-related 
2 0 activities. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to 
stimulate the release of follicle stimulating hormone (FSH). Thus, a protein of the present 
invention, alone or in heterodimers with a member of the inhibin a family, may be useful 
as a contraceptive based on the ability of inhibins to decrease fertility in female mammals 

2 5 and decrease spermatogenesis in male mammals. Administration of sufficient amounts 

of other inhibins can induce infertility in these mammals. Alternatively, the protein of the 
invention, as a homodimer or as a heterodimer with other protein subunits of the inhibin- 
P group, may be useful as a fertility inducing therapeutic, based upon the ability of activin 
molecules in stimulating FSH release from cells of the anterior pituitary. See, for example, 

3 0 United States Patent 4,798,885. A protein of the invention may also be useful for 

advancement of the onset of fertility in sexually immature mammals, so as to increase the 
lifetime reproductive performance of domestic animals such as cows, sheep and pigs. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 
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Assays for activin/inhibin activity include, without limitation, those described in: 
Vale et al., Endocrinology 91562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et 
al., Nature 321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et al., Proc. 
Nad. Acad. Sci. USA 83:3091-3095, 1986. 

5 

Chemotactic/ Chemokinetic Activity 

A protein of the present invention may have chemotactic or chemokinetic activity 
(e.g., act as a chemokine) for mammalian cells, including, for example, monocytes, 
fibroblasts, neutrophils, T-cells, mast cells, eosinophils, epithelial and/ or endothelial cells. 

1 0 Chemotactic and chemokinetic proteins can be used to mobilize or attract a desired cell 
population to a desired site of action. Chemotactic or chemokinetic proteins provide 
particular advantages in treatment of wounds and other trauma to tissues, as well as in 
treatment of localized infections. For example, attraction of lymphocytes, monocytes or 
neutrophils to tumors or sites of infection may result in improved immune responses 

1 5 against the tumor or infecting agent 

A protein or peptide has chemotactic activity for a particular cell population if it 
can stimulate, directly or indirectly, the directed orientation or movement of such cell 
population. Preferably, the protein or peptide has the ability to directly stimulate directed 
movement of cells. Whether a particular protein has chemotactic activity for a population 

20 of cells can be readily determined by employing such protein or peptide in any known 

> 

assay for cell chemotaxis. 

The activity of a protein of the invention may, among other means, be measured 
by the following methods: 

Assays for chemotactic activity (which will identify proteins that induce or prevent 

2 5 chemotaxis) consist of assays that measure the ability of a protein to induce the migration 

of cells across a membrane as well as the ability of a protein to induce the adhesion of one 
cell population to another cell population. Suitable assays for movement and adhesion 
include, without limitation, those described in: Current Protocols in Immunology, Ed by 
J.E. Coligan, A.M. Kruisbeek, D.H. Margulies, E.M. Shevach, W.Strober, Pub. Greene 

3 0 Publishing Associates and Wiley-Interscience (Chapter 6.12, Measurement of alpha and 

beta Chemokines 6.12.1-6.12^8; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. 
APMIS 103:140-146, 1995; Muller et al Eur. J. Immunol. 25: 1744-1748; Gruber et al. J. of 
Immunol. 1525860-5867, 1994; Johnston et al. J. of Immunol. 153: 1762-1768, 1994. 
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Hemostatic and Thrombolytic Activity 

A protein of the invention may also exhibit hemostatic or thrombolytic activity. 
As a result, such a protein is expected to be useful in treatment of various coagulation 
disorders (including hereditary disorders, such as hemophilias) or to enhance coagulation 
5 and other hemostatic events in treating wounds resulting from trauma, surgery or other 
causes. A protein of the invention may also be useful for dissolving or inhibiting 
formation of thromboses and for treatment and prevention of conditions resulting 
therefrom (such as, for example, infarction of cardiac and central nervous system vessels 
(e.g., stroke). 

1 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Assay for hemostatic and thrombolytic activity include, without limitation, those 
described in: Linet et al., J. din. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis 
Res. 45:41^419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

15 35:467-474, 1988. 



Receptor/Ligand Activity 

A protein of the present invention may also demonstrate activity as receptors, 
receptor ligands or inhibitors or agonists of receptor/ligand interactions. Examples of 

20 such receptors and ligands include, without limitation, cytokine receptors and their 
ligands, receptor kinases and their ligands, receptor phosphatases and their ligands, 
receptors involved in cell-cell interactions and their ligands (including without limitation, 
cellular adhesion molecules (such as selectins, integrins and their ligands) and 
receptor/ligand pairs involved in antigen presentation, antigen recognition and 

25 development of cellular and humoral immune responses). Receptors and ligands are also 
useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction. A protein of the present invention (including, without 
limitation, fragments of receptors and ligands) may themselves be useful as inhibitors of 
receptor/ligand interactions. 

3 0 The activity of a protein of the invention may, among other means, be measured 

by the following methods: 

Suitable assays for receptor-ligand activity include without limitation those 
described iruCurrent Protocols in Immunology, Ed by J.E. Coligan, A.M. Kruisbeek, D.H. 
Margulies, EM. Shevach, W.Strober, Pub. Greene Publishing Associates and 
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Wiley-Interscience (Chapter 7.28, Measurement of Cellular Adhesion under static 
conditions 728A-7282Z), Takai et al., Proc. Natl. Acad. Sci. USA 84:6864-6868, 1987; 
BiereretaL, J. Exp. Med. 168:1145-1156, 1988; Rosenstein etal., J. Exp. Med. 169:149-160 
1989; Stoltenborg et al., J. Immunol. Methods 17559-68, 1994; Stitt et al v Cell 80:661-670, 
5 1995. 



Anti-Inflammatory Activity 

Proteins of the present invention may also exhibit anti-inflammatory activity. The 
anti-inflammatory activity may be achieved by providing a stimulus to cells involved in 

10 the inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for 
example, cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the 
inflammatory process, inhibiting or promoting cell extravasation, or by stimulating or 
suppressing production of other factors which more directly inhibit or promote an 
inflammatory response. Proteins exhibiting such activities can be used to treat 

15 inflammatory conditions including chronic or acute conditions), including without 
limitation inflammation associated with infection (such as septic shock, sepsis or systemic 
inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, endotoxin 
lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 
chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting 

2 0 from over production of cytokines such as TNF or IL-1. Proteins of the invention may also 
be useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 

Cadherin /Tumor Invasion Suppressor Activity 

Cadherins are calcium-dependent adhesion molecules that appear to play major 
roles during development, particularly in defining specific cell types. Loss or alteration 
of normal cadherin expression can lead to changes in cell adhesion properties linked to 
tumor growth and metastasis. Cadherin malfunction is also implicated in other human 
diseases, such as pemphigus vulgaris and pemphigus foliaceus (auto-immune blistering 
skin diseases), Crohn's disease, and some developmental abnormalities. 

The cadherin superfamily includes well over forty members, each with a distinct 
pattern of expression. All members of the superfamily have in common conserved 
extracellular repeats (cadherin domains), but structural differences are found in other 
parts of the molecule. The cadherin domains bind calcium to form their tertiary structure 
and thus calcium is required to mediate their adhesion. Only a few amino acids in the 
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first cadherin domain provide the basis for homophilic adhesion; modification of this 
recognition site can change the specificity of a cadherin so that instead of recognizing only 
itself, the mutant molecule can now also bind to a different cadherin. In addition, some 
cadherins engage in heterophils adhesion with other cadherins. 
5 E-cadherin, one member of the cadherin superfamily, is expressed in epithelial cell 

types. Pathologically, if E-cadherin expression is lost in a tumor, the malignant cells 
become invasive and the cancer metastasizes. Transfection of cancer cell lines with 
polynucleotides expressing E-cadherin has reversed cancer-associated changes by 
returning altered cell shapes to normal, restoring cells' adhesiveness to each other and to 

10 their substrate, decreasing the cell growth rate, and drastically reducing anchorage- 
independent cell growth. Thus, reintroducing E-cadherin expression reverts carcinomas 
to a less advanced stage. It is likely that other cadherins have the same invasion 
suppressor role in carcinomas derived from other tissue types. Therefore, proteins of the 
present invention with cadherin activity, and polynucleotides of the present invention 

15 encoding such proteins, can be used to treat cancer. Introducing such proteins or 
polynucleotides into cancer cells can reduce or eliminate the cancerous changes observed 
in these cells by providing normal cadherin expression. 

Cancer cells have also been shown to express cadherins of a different tissue type 
than their origin, thus allowing these cells to invade and metastasize in a different tissue 

20 in the body. Proteins of the present invention with cadherin activity, and polynucleotides 
of the present invention encoding such proteins, can be substituted in these cells for the 
inappropriately expressed cadherins, restoring normal cell adhesive properties and 
reducing or eliminating the tendency of the cells to metastasize. 

Additionally, proteins of the present invention with cadherin activity, and 

25 polynucleotides of the present invention encoding such proteins, can used to generate 
antibodies recognizing and binding to cadherins. Such antibodies can be used to block 
the adhesion of inappropriately expressed tumor-cell cadherins, preventing the cells from 
forming a tumor elsewhere. Such an anti-cadherin antibody can also be used as a marker 
for the grade, pathological type, and prognosis of a cancer, i.e. the more progressed the 

3 0 cancer, the less cadherin expression there will be, and this decrease in cadherin expression 
can be detected by the use of a cadherin-binding antibody. 

Fragments of proteins of the present invention with cadherin activity, preferably 
a polypeptide comprising a decapeptide of the cadherin recognition site, and poly- 
nucleotides of the present invention encoding such protein fragments, can also be used 

52 



WO 98/25962 



PCT/US97/23224 



to block cadherin function by binding to cadherins and preventing them from binding in 
ways that produce undesirable effects. Additionally, fragments of proteins of the present 
invention with cadherin activity, preferably truncated soluble cadherin fragments which 
have been found to be stable in the circulation of cancer patients, and polynucleotides 
5 encoding such protein fragments, can be used to disturb proper cell-cell adhesion. 

Assays for cadherin adhesive and invasive suppressor activity include, without 
limitation, those described in: Hortsch et al. J Biol Chem 270 (32): 18809-18817, 1995; 
Miyaki et al. Oncogene 11: 2547-2552, 1995; Ozawa et al. Cell 63: 1033-1038, 1990. 

10 Tumor Inhibition Activity 

In addition to the activities described above for immunological treatment or 
prevention of tumors, a protein of the invention may exhibit other anti-tumor activities. 
A protein may inhibit tumor growth directly or indirectly (such as, for example, via 
ADCQ. A protein may exhibit its tumor inhibitory activity by acting on tumor tissue or 

15 tumor precursor tissue, by inhibiting formation of tissues necessary to support tumor 
growth (such as, for example, by inhibiting angiogenesis), by causing production of other 
factors, agents or cell types which inhibit tumor growth, or by suppressing, eliminating 
or inhibiting factors, agents or cell types which promote tumor growth. 

20 Other Activities 

A protein of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious 
agents, including, without limitation, bacteria, viruses, fungi and other parasites; effecting 
(suppressing or enhancing) bodily characteristics, including, without limitation, height, 

2 5 weight, hair color, eye color, skin, fat to lean ratio or other tissue pigmentation, or organ 

or body part size or shape (such as, for example, breast augmentation or diminution, 
change in bone form or shape); effecting biorhythms or caricadic cycles or rhythms; 
effecting the fertility of male or female subjects; effecting the metabolism, catabolism, 
anabolism, processing, utilization, storage or elimination of dietary fat, lipid, protein, 

3 0 carbohydrate, vitamins, minerals, cofactors or other nutritional factors or components); 

effecting behavioral characteristics, including, without limitation, appetite, libido, stress, 
cognition (including cognitive disorders), depression (including depressive disorders) and 
violent behaviors; providing analgesic effects or other pain reducing effects; promoting 
differentiation and growth of embryonic stem cells in lineages other than hematopoietic 
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lineages; hormonal or endocrine activity; in the case of enzymes, correcting deficiencies 
of the enzyme and treating deficiency-related diseases; treatment of hyperproliferative 
disorders (such as, for example, psoriasis); immunoglobulin-like activity (such as, for 
example, the ability to bind antigens or complement); and the ability to act as an antigen 
5 in a vaccine composition to raise an immune response against such protein or another 
material or entity which is cross-reactive with such protein. 

ADMINISTRATION AND DOSING 

A protein of the present invention (from whatever source derived, including 

10 without limitation from recombinant and non-recombinant sources) may be used in a 
pharmaceutical composition when combined with a pharmaceutically acceptable carrier. 
Such a composition may also contain (in addition to protein and a carrier) diluents, fillers, 
salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 

1 5 effectiveness of the biological activity of the active ingredients). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of 
the invention may also contain cytokines, lymphokines, or other hematopoietic factors 
such as M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL4, IL-5, IL-6, IL-7, 11^8, IL-9, ILrlO, ILrll, 
IL-12, EL-13, 11^14, IL-15, IFN, TNFO, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem 

2 0 cell factor, and erythropoietin. The pharmaceutical composition may further contain other 
agents which either enhance the activity of the protein or compliment its activity or use 
in treatment. Such additional factors and/or agents may be included in the 
pharmaceutical composition to produce a synergistic effect with protein of the invention, 
or to minimize side effects. Conversely, protein of the present invention may be included 

25 in formulations of the particular cytokine, lymphokine, other hematopoietic factor, 
thrombolytic or anti-thrombotic factor, or anti-inflammatory agent to minimize side effects 
of the cytokine, lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic 
factor, or anti-inflammatory agent 

A protein of the present invention may be active in multimers (e.g., heterodimers 

30 or homodimers) or complexes with itself or other proteins. As a result, pharmaceutical 
compositions of the invention may comprise a protein of the invention in such multimeric 
or complexed form. 

The pharmaceutical composition of the invention may be in the form of a complex 
of the protein(s) of present invention along with protein or peptide antigens. The protein 
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and /or peptide antigen will deliver a stimulatory signal to both B and T lymphocytes. B 
lymphocytes will respond to antigen through their surface immunoglobulin receptor. T 
lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins 
5 including those encoded by class I and class II MHC genes on host cells will serve to 
present the peptide antigen(s) to T lymphocytes. The antigen components could also be 
supplied as purified MHC-peptide complexes alone or with co-stimulatory molecules that 
can directly signal T cells. Alternatively antibodies able to bind surface immunolgobulin 
and other molecules on B cells as well as antibodies able to bind the TCR and other 
10 molecules on T cells can be combined with the pharmaceutical composition of the 
invention. 

The pharmaceutical composition of the invention may be in the form of a liposome 
in which protein of the present invention is combined, in addition to other 
pharmaceutical^ acceptable carriers, with amphipathic agents such as lipids which exist 

15 in aggregated form as micelles, insoluble monolayers, liquid crystals, or lamellar layers 
in aqueous solution. Suitable lipids for liposomal formulation include, without limitation, 
monoglycerides, diglycerides, sulfatides, lysolecithin, phospholipids, saponin, bile acids, 
and the like. Preparation of such liposomal formulations is within the level of skill in the 
art, as disclosed, for example, in U.S. Patent No. 4,235,871; US. Patent No. 4,501,728; U.S. 

2 0 Patent No. 4,837,028; and U.S. Patent No. 4,737,323, all of which are incorporated herein 
by reference. 

As used herein, the term "therapeutically effective amount" means the total 
amount of each active component of the pharmaceutical composition or method that is 
sufficient to show a meaningful patient benefit, i.e., treatment, healing, prevention or 
2 5 amelioration of the relevant medical condition, or an increase in rate of treatment, healing, 
prevention or amelioration of such conditions. When applied to an individual active 
ingredient, administered alone, the term refers to that ingredient alone. When applied to 

a combination, the term refers to combined amounts of the active ingredients that result 

> 

in the therapeutic effect, whether administered in combination, serially or simultaneously. 
30 In practicing the method of treatment or use of the present invention, a 

therapeutically effective amount of protein of the present invention is administered to a 
mammal having a condition to be treated. Protein of the present invention may be 
administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines 
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or other hematopoietic factors. When co-administered with one or more cytokines, 
lymphokines or other hematopoietic factors, protein of the present invention may be 
administered either simultaneously with the cytokine(s) / lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If 
5 administered sequentially, the attending physician will decide on the appropriate 
sequence of administering protein of the present invention in combination with 
cytokine^), lymphokine(s), other hematopoietic factors), thrombolytic or anti-thrombotic 
factors. 

Administration of protein of the present invention used in the pharmaceutical 
10 composition or to practice the method of the present invention can be carried out in a 
variety of conventional ways, such as oral ingestion, inhalation, topical application or 
cutaneous, subcutaneous, intraperitoneal, parenteral or intravenous injection. 
Intravenous administration to the patient is preferred. 

When a therapeutically effective amount of protein of the present invention is 
15 administered orally, protein of the present invention will be in the form of a tablet, 
capsule, powder, solution or elixir. When administered in tablet form, the pharmaceutical 
composition of the invention may additionally contain a solid carrier such as a gelatin or 
an adjuvant The tablet, capsule, and powder contain from about 5 to 95% protein of the 
present invention, and preferably from about 25 to 90% protein of the present invention. 
2 0 When administered in liquid form, a liquid carrier such as water, petroleum, oils of animal 
or plant origin such as peanut oil, mineral oil, soybean oil, or sesame oil, or synthetic oils 
may be added. The liquid form of the pharmaceutical composition may further contain 
physiological saline solution, dextrose or other saccharide solution, or glycols such as 
ethylene glycol, propylene glycol or polyethylene glycol. When administered in liquid 

2 5 form, the pharmaceutical composition contains from about 0.5 to 90% by weight of protein 

of the present invention, and preferably from about 1 to 50% protein of the present 
invention. 

When a therapeutically effective amount of protein of the present invention is 
administered by intravenous, cutaneous or subcutaneous injection, protein of the present 

3 0 invention will be in the form of a pyrogen-free, parenterally acceptable aqueous solution. 

The preparation of such parenterally acceptable protein solutions, having due regard to 
pH, isotonicity, stability, and the like, is within the skill in the art. A preferred 
pharmaceutical composition for intravenous, cutaneous, or subcutaneous injection should 
contain, in addition to protein of the present invention, an isotonic vehicle such as Sodium 
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Chloride Injection, Ringer's Injection, Dextrose Injection, Dextrose and Sodium Chloride 
Injection, La eta ted Ringer's Injection, or other vehicle as known in the art. The 
pharmaceutical composition of the present invention may also contain stabilizers, 
preservatives, buffers, antioxidants, or other additives known to those of skill in the art. 
5 The amount of protein of the present invention in the pharmaceutical composition 

of the present invention will depend upon the nature and severity of the condition being 
treated, and on the nature of prior treatments which the patient has undergone. 
Ultimately, the attending physician will decide the amount of protein of the present 
invention with which to treat each individual patient. Initially, the attending physician 

10 will administer low doses of protein of the present invention and observe the patient's 
response. Larger doses of protein of the present invention may be administered until the 
optimal therapeutic effect is obtained for the patient, and at that point the dosage is not 
increased further. It is contemplated that the various pharmaceutical compositions used 
to practice the method of the present invention should contain about 0.01 pg to about 100 

15 mg (preferably about O.lng to about 10 mg, more preferably about 0.1 pg to about 1 mg) 
of protein of the present invention per kg body weight 

The duration of intravenous therapy using the pharmaceutical composition of the 
present invention will vary, depending on the severity of the disease being treated and 
the condition and potential idiosyncratic response of each individual patient. It is 

2 0 contemplated that the duration of each application of the protein of the present invention 

will be in the range of 12 to 24 hours of continuous intravenous administration. 
Ultimately the attending physician will decide on the appropriate duration of intravenous 
therapy using the pharmaceutical composition of the present invention. 

Protein of the invention may also be used to immunize animals to obtain 
25 polyclonal and monoclonal antibodies which specifically react with the protein. Such 
antibodies may be obtained using either the entire protein or fragments thereof as an 
immunogen. The peptide immunogens additionally may contain a cysteine residue at the 
carboxyl terminus, and are conjugated to a hapten such as keyhole limpet hemocyanin 
(KLH). Methods for synthesizing such peptides are known in the art, for example, as in 

3 0 R.P. Merrifield, J. Amer.Chem.Soc. 85, 2149-2154 (1963); J.L. Krstenansky, et al, FEBS Lett. 

211, 10 (1987). Monoclonal antibodies binding to the protein of the invention may be 
useful diagnostic agents for the immunodetection of the protein. Neutralizing monoclonal 
antibodies binding to the protein may also be useful therapeutics for both conditions 
associated with the protein and also in the treatment of some forms of cancer where 
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abnormal expression of the protein is involved. In the case of cancerous cells or leukemic 
cells, neutralizing monoclonal antibodies against the protein may be useful in detecting 
and preventing the metastatic spread of the cancerous cells, which may be mediated by 
the protein. 

5 For compositions of the present invention which are useful for bone, cartilage, 

tendon or ligament regeneration, the therapeutic method includes administering the 
composition topically, systematically, or locally as an implant or device. When 
administered, the therapeutic composition for use in this invention is, of course, in a 
pyrogen-free, physiologically acceptable form. Further, the composition may desirably 

10 be encapsulated or injected in a viscous form for delivery to the site of bone, cartilage or 
tissue damage. Topical administration may be suitable for wound healing and tissue 
repair. Therapeutically useful agents other than a protein of the invention which may also 
optionally be included in the composition as described above, may alternatively or 
additionally, be administered simultaneously or sequentially with the composition in the 

15 methods of the invention. Preferably for bone and/or cartilage formation, the 
composition would include a matrix capable of delivering the protein-containing 
composition to the site of bone and/or cartilage damage, providing a structure for the 
developing bone and cartilage and optimally capable of being resorbed into the body. 
Such matrices may be formed of materials presently in use for other implanted medical 

20 applications. 

The choice of matrix material is based on biocompatibility, biodegradability, 
mechanical properties, cosmetic appearance and interface properties. The particular 
application of the compositions will define the appropriate formulation. Potential 
matrices for the compositions may be biodegradable and chemically defined calcium 

25 sulfate, tricalciumphosphate, hydroxyapatite, polylactic acid, polyglycolic acid and 
polyanhydrides. Other potential materials are biodegradable and biologically well- 
defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins 
or extracellular matrix components. Other potential matrices are nonbiodegradable and 
chemically defined, such as sintered hydroxapatite, bioglass, aluminates, or other 

3 0 ceramics. Matrices may be comprised of combinations of any of the above mentioned 
types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalciumphosphate. The bioceramics may be altered in composition, such as in calcium- 
aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. 
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Presently preferred is a 5050 (mole weight) copolymer of lactic acid and glycolic 
acid in the form of porous particles having diameters ranging from 150 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as 
carboxymethyl cellulose or autologous blood dot, to prevent the protein compositions 
5 from disassociating from the matrix. 

A preferred family of sequestering agents is cellulosic materials such as 
alkylcelluloses (including hydroxy alkylcelluloses), including methylcellulose, 
ethylcellulose, hydroxyethylcellulose, hydroxypropylcellulose, hydroxypropyl- 
methylcellulose, and carboxymethylcellulose, the most preferred being cationic salts of 

1 0 carboxymethylcellulose (CMC). Other preferred sequestering agents include hyaluronic 
acid, sodium alginate, poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl 
polymer and poly(vinyl alcohol). The amount of sequestering agent useful herein is 05-20 
wt%, preferably 1-10 wt% based on total formulation weight, which represents the 
amount necessary to prevent desorbtion of the protein from the polymer matrix and to 

1 5 provide appropriate handling of the composition, yet not so much that the progenitor cells 
are prevented from infiltrating the matrix, thereby providing the protein the opportunity 
to assist the osteogenic activity of the progenitor cells. 

In further compositions, proteins of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 

2 0 question. These agents include various growth factors such as epidermal growth factor 
(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-cc and 
TGF-P), and insulin-like growth factor (IGF)* 

The therapeutic compositions are also presently valuable for veterinary 
applications. Particularly domestic animals and thoroughbred horses, in addition to 

2 5 humans, are desired patients for such treatment with proteins of the present invention. 

The dosage regimen of a protein-containing pharmaceutical composition to be 
used in tissue regeneration will be determined by the attending physician considering 
various factors which modify the action of the proteins, e.g., amount of tissue weight 
desired to be formed, the site of damage, the condition of the damaged tissue, the size of 

30 a wound, type of damaged tissue (e.g., bone), the patient's age, sex, and diet, the severity 
of any infection, time of ad^riinistration and other clinical factors. The dosage may vary 
with the type of matrix used in the reconstitution and with inclusion of other proteins in 
the pharmaceutical composition. For example, the addition of other known growth 
factors, such as IGF I (insulin like growth factor I), to the final composition, may also effect 
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the dosage. Progress can be monitored by periodic assessment of tissue/bone growth 
and/ or repair, for example, X-rays, histomorphometric determinations and tetracycline 
labeling. 

Polynucleotides of the present invention can also be used for gene therapy . Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other 
known methods for introduction of nucleic acid into a cell or organism (including, without 
limitation, in the form of viral vectors or naked DNA). 

Cells may also be cultured ex vivo in the presence of proteins of the present 
invention in order to proliferate or to produce a desired effect on or activity in such cells. 
Treated cells can thai be introduced in vivo for therapeutic purposes. 

Patent and literature references cited herein are incorporated by reference as if 
fully set forth. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Jacobs, Kenneth 

McCoy, John M. 
LaVallie # Edward R. 
Racie, Lisa A. 
Merberg , David 
Treacy, Maurice 
Spaulding , Vikki 
Agostino, Michael J. 

(ii) TITLE OF INVENTION: SECRETED PROTEINS AND POLYNUCLEOTIDES 
ENCODING THEM 

(iii) NUMBER OF SEQUENCES: 32 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: 87 Cambridge Park Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: U.S.A. 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 
<C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 

(A) NAME: Sprunger, Suzanne A. 

(B) REGISTRATION NUMBER: 41,323 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 498-8284 

(B) TELEFAX: (617) 876-5851 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 



PCT/DS97/23224 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

GTGGAAGGAG TGGATAATAA AATGAGTCAG TGCACCAGCT CCACCATTCC TAGCTCAAGT 60 

CAAGAGAAAG ACCCTAAAAT TAAGACAGAG ACAAGTGAAG AGGGATCTGG AGACTTGGAT 120 

AATCTAGATG CTATTCTTGG TGATCTGACT AGTTCTGACT TTTACAATAA TTCCATATCC 180 

TCAAATGGTA GTCATCTGGG GACTAAGCAA CAGGTGTTTC AAGGAACTAA TTCTCTGGGT 240 

TTGAAAAGTT CACAGTCTGT GCAGTCTATT CGTCCTCCAT ATAACCGAGC AGTGTCTCTG 300 

GATAGCCCTG TTTCTGTTGG CTCAAGTCCT CCAGTAAAAA ATATCAGTGC TTTCCCCATG 360 

TTACCAAAGC AACCCATGTT GGGTGGGAAT CCAAGAATGA TGGATAGTCA RGAAAATTAT 420 

GGCTCAAGTA TGGGAGACTG GGGCTTACCA AACTCAAAGG CC 462 
(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 147 amino acids 

(B) TYPE: amino acid 

( C ) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ser Gin Cys Thr Ser Ser Thr lie Pro Ser Ser Ser Gin Glu Lys 
15 10 15 

Asp Pro Lys lie Lys Thr Glu Thr Ser Glu Glu Gly Ser Gly Asp Leu 

20 25 30 

Asp Asn Leu Asp Ala lie Leu Gly Asp Leu Thr Ser Ser Asp Phe Tyr 
35 40 45 

Asn Asn Ser lie Ser Ser Asn Gly Ser His Leu Gly Thr Lys Gin Gin 

50 55 60 

Val Phe Gin Gly Thr Asn Ser Leu Gly Leu Lys Ser Ser Gin Ser Val 
65 70 75 80 

Gin Ser lie Arg Pro Pro Tyr Asn Arg Ala Val Ser Leu Asp Ser Pro 

85 90 95 
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Val Ser Val Gly Ser Ser Pro Pro Val Lys Asn lie Ser Ala Phe Pro 

100 105 110 

Met Leu Pro Lys Gin Pro Met Leu Gly Gly Asn Pro Arg Met Met Asp 
115 120 125 

Ser Gin Glu Asn Tyr Gly Ser Ser Met Gly Asp Trp Gly Leu Pro Asn 
130 135 140 

Ser Lys Ala 
145- 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 60 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAA 119 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3316 base pairs 

(B) TYPE: nucleic acid 

(C ) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

GGTAAGATGG CGGCTGTGAG TCTGCGGCTC GGCGACTTGG TGTGGGGGAA ACTCGGCCGA 60 

TATCCTCCTT GGCCAGGAAA GATTGTTAAT CCACCAAAGG ACTTGAAGAA ACCTCGCGGA 120 

AAGAAATGCT TCTTTGTGAA ATTTTTTGGA ACAGAAGATC ATGCCTGGAT CAAAGTGGAA 180 

CAGCTGAAGC CATATCATGC TCATAAAGAG GAAATGATAA AAATTAACAA GGGTAAACGA 240 

TTCCAGCAAG CGGTAGATGC TGTCGAAGAG TTCCTCAGGA GAGCCAAAGG GAAAGACCAG 300 
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ACGTCATCCC ACAATTCTTC TGATGACAAG AATCGACGTA ATTCCAGTGA GGAGAGAAGT 360 

AGGCCAAACT CAGGTGATGA GAAGCGCAAA CTTAGCCTGT CTGAAGGGAA GGTGAAGAAG 420 

AACATGGGAG AAGGAAAGAA GAGGGTGTCT TCAGGCTCTT CAGAGAGAGG CTCCAAATCC 480 

CCTCTGAAAA GAGCCCAAGA GCAAAGTCCC CGGAAGCGGG GTCGGCCCCC AAAGGATGAG 540 

AAGGATCTCA CCATCCCGGA GTCTAGTACC GTGAAGGGGA TGATGGCCGG ACCGATGGCC 600 

GCGTTTAAAT GGCAGCCAAC CGCAAGCGAG CCTGTTAAAG ATGCAGATCC TCATTTCCAT 660 

CATTTCCTGC TAAGCCAAAC AGAGAAGCCA GCTGTCTGTT ACCAGGCAAT CACGAAGAAG 720 

TTGAAAATAT GTGAAGAGGA AACTGGCTCC ACCTCCATCC AGGCAGCTGA CAGCACAGCC 780 

GTGAATGGCA GCATCACACC CACAGACAAA AAGATAGGAT TTTTGGGCCT TGGTCTCATG 840 

GGAAGTGGAA TCGTCTCCAA CTTGCTAAAA ATGGGTCACA CAGTGACTGT CTGGAACCGC 900 

ACTGCAGAGA AAGAGGGGGC CCGTCTGGGA AGAACCCCCG CTGAAGTCGT CTCAACCTGC 960 

GACATCACTT TCGCCTGCGT GTCGGATCCC AAGGCGGCCA AGGACCTGGT GCTGGGCCCC 1020 

AGTGGTGTGC TGCAAGGGAT CCGCCCTGGG AAGTGCTACG TGGACATGTC AACAGTGGAC 1080 

GCTGACACCG TCACTGAGCT GGCCCAGGTG ATTGTGTCCA GGGGGGGGCG CTTTCTGGAA 1140 

GCCCCCGTCT CAGGGAATCA GCAGCTGTCT AATGACGGGA TGTTGGTGAT CTTAGCGGCT 1200 

GGAGACAGGG GCTTATATGA GGACTGCAGC AGCTGCTTCC AGGCGATGGG GAAGACCTCC 1260 

TTCTTCCTAG GTGAAGTGGG CAATGCAGCC AAGATGATGC TGATCGTGAA CATGGTCCAA 1320 

GGGAGCTTCA TGGCCACTAT TGCCGAGGGG CTGACCCTGG CCCAGGTGAC AGGCCAGTCC 1380 

CAGCAGACAC TCTTGGACAT CCTCAATCAG GGACAGTTGG CCAGCATCTT CCTGGACCAG 1440 

AAGTGCCAAA ATATCCTGCA AGGAAACTTT AAGCCTGATT TCTACCTGAA ATACATTCAG 1500 

AAGGATCTCC GCTTAGCCAT TGCGCTGGGT GATGCGGTCA ACCATCCGAC TCCCATGGCA 1560 

GCTGCAGCAA ATGAGGTGTA CAAAAGAGCC AAGGCGCTGG ACCAGTCTGA CAACGATATG 1620 

TCCGCCGTGT ACCGAGCCTA CATACACTAA GCTGTCGACA CCCCGCCCTC ACCCCTCCAA 1680 

TCCCCCCTCT GACCCCCTCT TCCTCACATG GGGTCGGGGG CCTGGGAGTT CATTCTGGAC 1740 

CAGCCCACCT ATCTCCATTT CCTTTTATAC AGACTTTGAG ACTTGCCATC AGCACAGCAC 1800 

ACAGCAGCAC CCTTCCCCTG AGGCCGGTGG GGAGGGGACA AGTGTCAGCA GGATTGGCGT 1860 

GTGGGAAAGC TCTTGAGCTG GGCACTGGCC CCCCGGACGA GGTGGCTGTG TGTTCACACA 1920 

CACACACACA CACACACACA CACAGGCTCT CGCCCCAGGA TAGAAGCTGC CCAGAAACTG 1980 
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CTGCCTGGCT TTTTTTCTTC CGAGCTTGTC TTATCTCAAA CCCCTTCCAG TCAAGGAACT 2040 

AGAATCAGCA ACGAGAGTTG GAAGCCTTCC CACAGCTTCC CCCAGAGCGA AGAGGCTGTA 2100 

GTCATGTCCC CATCCCCCAC TGGATTCCCT ACAAGGAGAG GCCTTGGGCC CAGATGAGCC 2160 

AGTACAGACT CCAGACAGAG GGGCCCTTGG GGCCCTCCAA CCTCAGGTGA TGAGCTGAGA 2220 

AAGATGTTCA CGTCTAAGCG TCCAGTGTGC ACCCAGCGCT CCATAGACGC CTTTGTGAAC 2280 

TGAAAAGAGA CTGGCAGAGT CCCGAGAAGA TGGGGCCCTG GCTTTCCAGG GAGTGCAGCA 2340 

AGCAGCCGGC CTGCAGACCC AGCCTGACCA ACGATGAGCA TTTCTTAGGC TCAGCTCTTG 2400 

ATACGGAAAC GAGTGTCTTC ACTCCAGCCA GCATCATGGT CTTCGGTGCT TCCCGGGCCC 2460 

GGGGTCTGTC GGGAGGGAAG AGAACTGGGC CTGACCTACC TGAACTGACT GGCCCTCCGA 2520 

GGTGGGTCTG GGACATCCTA GAGGCCCTAC ATTTGTCCTT GGATAGGGGA CCGGGGGGGG 2580 

CTTGGAATGT TGCAAAAAAA AAAGTTACCC AAGGGATGTC AGTTTTTTAT CCCTCTGCAT 2640 

GGGTTGGATT TTCCAAAATC ATAATTTGCA GAAGGAAGGC CAGCATTTAC GATGCAATAT 2700 

GTAATTATAT ATAGGGTGGC CACACTAGGG CGGGGTCCTT CCCCCCTCAC AGCTTTGGCC 2760 

CCTTTCAGAG ATTAGAAACT GGGTTAGAGG ATTGCAGAAG ACGAGTGGGG GGAGGGCAGG 2820 

GAAGATGCCT GTCGGGTTTT TAGCACAGTT CATTTCACTG GGATTTTGAA GCATTTCTGT 2880 

CTGGACACAA AGCCTGTTCT AGTCCTGGCG GAACACACTG GGGGTGGGGG CGGGGGAAGA 2940 

TGCGGTAATG AAACCGGTTA GTCAATTTTG TCTTAATATT GTTGACAATT CTGTAAAGTT 3000 

CCTTTTTATG AATATTTCTG TTTAAGCTAT TTCACCTTTC TTTTGAAATC CTTCCCTTTT 3060 

AAGGAGAAAA TGTGACACTT GTGAAAAAGC TTGTAAGAAA GCCCCTCCCT TTTTTCTTTA 3120 

AACCTTTAAA TGACAAATCT AGGTAATTAA GGTTGTGAAT TTTTATTTTT GCTTTGTTTT 3180 

TAATGAACAT TTGTCTTTCA GAATAGGATT GTGTGATAAT GTTTAAATGG CAAAAACAAA 3240 

ACATGATTTT GTGCAATTAA CAAAGCTACT GCAAGAAAAA TAAAACACTT CTTGGTAACA 3300 

CAAAAAAAAA AAAAAA 3316 
(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 547 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Met Ala Ala Val Ser Leu Arg Leu Gly Asp Leu Val Trp Gly Lys Leu 
15 10 15 

Gly Arg Tyr Pro Pro Trp Pro Gly Lys lie Val Asn Pro Pro Lys Asp 

20 25 30 

Leu Lys Lys Pro Arg Gly Lys Lys Cys Phe Phe Val Lys Phe Phe Gly 
35 40 45 

Thr Glu Asp His Ala Trp lie Lys Val Glu Gin Leu Lys Pro Tyr His 
50 55 60 

Ala His Lys Glu Glu Met lie Lys He Asn Lys Gly Lys Arg Phe Gin 
65 70 75 80 

Gin Ala Val Asp Ala Val Glu Glu Phe Leu Arg Arg Ala Lys Gly Lys 

85 90 95 

Asp Gin Thr Ser Ser His Asn Ser Ser Asp Asp Lys Asn Arg Arg Asn 

100 105 110 

Ser Ser Glu Glu Arg Ser Arg Pro Asn Ser Gly Asp Glu Lys Arg Lys 
115 120 125 

Leu Ser Leu Ser Glu Gly Lys Val Lys Lys Asn Met Gly Glu Gly Lys 
130 135 140 

Lys Arg Val Ser Ser Gly Ser Ser Glu Arg Gly Ser Lys Ser Pro Leu 
145 150 155 160 

Lys Arg Ala Gin Glu Gin Ser Pro Arg Lys Arg Gly Arg Pro Pro Lys 

165 170 175 

Asp Glu Lys Asp Leu Thr He Pro Glu Ser Ser Thr Val Lys Gly Met 

180 185 190 

Met Ala Gly Pro Met Ala Ala Phe Lys Trp Gin Pro Thr Ala Ser Glu 
195 200 205 

Pro Val Lys Asp Ala Asp Pro His Phe His His Phe Leu Leu Ser Gin 
210 215 220 

Thr Glu Lys Pro Ala Val Cys Tyr Gin Ala He Thr Lys Lys Leu Lys 
225 230 235 240 

He Cys Glu Glu Glu Thr Gly Ser Thr Ser He Gin Ala Ala Asp Ser 

245 250 255 

Thr Ala Val Asn Gly Ser He Thr Pro Thr Asp Lys Lys He Gly Phe 

260 265 270 
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Leu Gly Leu Gly Leu Met Gly Ser Gly lie Val Ser Asn Leu Leu Lys 
275 280 285 

Met Gly His Thr Val Thr Val Trp Asn Arg Thr Ala Glu Lys Glu Gly 
290 295 300 

Ala Arg Leu. Gly Arg Thr Pro Ala Glu Val Val Ser Thr Cys Asp lie 
305 310 315 320 

Thr Phe Ala Cys Val Ser Asp Pro Lys Ala Ala Lys Asp Leu Val Leu 

325 330 335 

Gly Pro Ser Gly Val Leu Gin Gly lie Arg Pro Gly Lys Cys Tyr Val 

340 345 350 

Asp Met Ser Thr Val Asp Ala Asp Thr Val Thr Glu Leu Ala Gin Val 
355 360 365 

lie Val Ser Arg Gly Gly Arg Phe Leu Glu Ala Pro Val Ser Gly Asn 
370 375 380 

Gin Gin Leu Ser Asn Asp Gly Met Leu Val lie Leu Ala Ala Gly Asp 
385 390 395 400 

Arg Gly Leu Tyr Glu Asp Cys Ser Ser Cys Phe Gin Ala Met Gly Lys 

405 410 415 

Thr Ser Phe Phe Leu Gly Glu Val Gly Asn Ala Ala Lys Met Met Leu 

420 425 430 

lie Val Asn Met Val Gin Gly Ser Phe Met Ala Thr He Ala Glu Gly 
435 440 445 

Leu Thr Leu Ala Gin Val Thr Gly Gin Ser Gin Gin Thr Leu Leu Asp 
450 455 460 

He Leu Asn Gin Gly Gin Leu Ala Ser lie Phe Leu Asp Gin Lys Cys 
465 470 475 480 

Gin Asn He Leu Gin Gly Asn Phe Lys Pro Asp Phe Tyr Leu Lys Tyr 

485 490 495 

He Gin Lys Asp Leu Arg Leu Ala He Ala Leu Gly Asp Ala Val Asn 

500 505 510 

His Pro Thr Pro Met Ala Ala Ala Ala Asn Glu Val Tyr Lys Arg Ala 
515 520 525 

Lys Ala Leu Asp Gin Ser Asp Asn Asp Met Ser Ala Val Tyr Arg Ala 
530 535 540 

Tyr He His 
545 

(2) INFORMATION FOR SEQ ID NO: 6: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

ATTGGTTCAA GAAGAAAATT TGAGCCCAGG CACCCAAACA CCTTCAAATG ATAAAGCAAG 60 

TATGTTGCAA GAATACTCCA AATACCTCCA ACAGGCTTTT GAAAAATCCA CTAATGCAAG 120 

TTTTACTCTT GGACACGGTT TCCAATTTGT CAGTTTGTCT TCACCTCTCC ACAACCACAC 180 

TTTGTTTCCA GAAAAACAAA TATACACTAC GTCTCCTTTG GAGTGTGGTT TCGGCCAATC 240 

TGTTACCTCA GTGTTGCCAT CTTCATTGCC AAAGCCTCCT TTTGGGATGT TGTTTGGATC 300 

TCAGCCAGGT CTTTATTTGT CTGCTTTGGA TGCTACACAT CAGCAGTTGA CACCTTCCCA 360 

GGAGCTGGAT GATCTGATAG ATTCTCAGAA GAACTTAGAG ACTTCATCAG CCTTCCAGTC 420 

CTCATCTCAG AAATTGACTA GCCAGAAGGA ACAGAAAAAC TTAGAGTCTT CAACAGGCTT 480 

TCAGATTCCA TCTCAGGAGT TAGCTAGCCA GATAGATCCT CAGAAAGACA TAGAGCCTAG 540 

AACAACGTAT CAGATTGAGA ACTTTGCACA AGCGTTTGGT TCTCAGTTTA AGTCGGGCAG 600 

CAGGGTGCCA ATGACCTTTA TCACTAACTC TAATGGAGAA GTGGACCATA GAGTAAGGAC 660 

TTCAGTGTCA GATTTCTCAG GGTATACAAA TATGATGTCT GATGTAAGTG AGCCATGTAG 720 

TACAAGAGTA AAGACACCCA CCAGCCAGAG TTACAGGTAA GGTCCCAAAA GTGGCCAGGC 780 

TGGAGGTTTT TTAATGTAAT TTTGTTTTAT TTTGAGAACA CTGCCATTGG AATGTTTTTA 840 

CACGATCCTA TTAAGAATAA TGTGATGCCC TTTCAATGCA ACTTTTCATA TTTAGTTTAT 900 

TTTGTTAGCG TGATTTTAGC TCTGTTTGTA TTATGATTTT TAATCAAAAT CAATAGATTA 960 

AAAATAGTTT GACATTCAAA GTGACAATGT TTAGCAATCA AATTTACATG TATAGATTGT 1020 

CAGGGAATAG CCCAAATGTT TTAAACGCAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 1080 

AAAAAAAAAA AAAAAAA 1097 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 232 amino acids 
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(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Met Leu Gin Glu Tyr Ser Lys Tyr Leu Gin Gin Ala Phe Glu Lys Ser 
15 10 15 

Thr Asn Ala Ser Phe Thr Leu Gly His Gly Phe Gin Phe Val Ser Leu 

20 25 30 

Ser Ser Pro Leu His Asn His Thr Leu Phe Pro Glu Lys Gin lie Tyr 
35 40 45 

Thr Thr Ser Pro Leu Glu Cys Gly Phe Gly Gin Ser Val Thr Ser Val 
50 55 60 

Leu Pro Ser Ser Leu Pro Lys Pro Pro Phe Gly Met Leu Phe Gly Ser 
65 70 75 80 

Gin Pro Gly Leu Tyr Leu Ser Ala Leu Asp Ala Thr His Gin Gin Leu 

85 90 95 

Thr Pro Ser Gin Glu Leu Asp Asp Leu lie Asp Ser Gin Lys Asn Leu 

100 105 110 

Glu Thr Ser Ser Ala Phe Gin Ser Ser Ser Gin Lys Leu Thr Ser Gin 
115 120 125 

Lys Glu Gin Lys Asn Leu Glu Ser Ser Thr Gly Phe Gin lie Pro Ser 
130 135 140 

Gin Glu Leu Ala Ser Gin lie Asp Pro Gin Lys Asp lie Glu Pro Arg 
145 150 155 160 

Thr Thr Tyr Gin lie Glu Asn Phe Ala Gin Ala Phe Gly Ser Gin Phe 

165 170 175 

Lys Ser Gly Ser Arg Val Pro Met Thr Phe lie Thr Asn Ser Asn Gly 

180 185 190 

Glu Val Asp His Arg Val Arg Thr Ser Val Ser Asp Phe Ser Gly Tyr 
195 200 205 

Thr Asn Met Met Ser Asp Val Ser Glu Pro Cys Ser Thr Arg Val Lys 
210 215 220 

Thr Pro Thr Ser Gin Ser Tyr Arg 
225 230 
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(2) INFORMATION FOR , SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

GTGTCACATA CCACTCTTGT AGGTGTCCTC AATAATCCCC TTTTCCCACA AAATACACAG 60 

GGTGTATTAT CTTTCTCTTT ATTCACCCCC ACTTTGCTGA ACTGAAGTTA ATTACATAGC 120 

CTTTCTTCTA ACCTCCTTAG TAATGAACCT TCACATAAAG TGTATTTACA GCGTCTGTGG 180 

TAGCCAGCCC TTCCTCCTCT ACTTTCTAGG AGGGGATAGC CAATAACTAG GAATTTAATG 240 

ACAGATTTTT TTTTTCTTTG AAATAAATGG CCAGAGTTTC TCCATTTTAG AATTTTGTTG 300 

TCCTCCTTAA TCATCTGCTT ACCTAGTCAT TACTCAATCT GCAGAAACTT CATAAAGGAA 360 

AAGTGCTGCA TTGTTTTTAC AAATAACAGT TTGTAGGGAA AATATGACAA ACCTCAACTA 420 

TGGGAGTTGT CCACAATACA AAATTTTGAA AAAACATTAC ATAGTGATAA TATCATACTT 480 

GGTTGTTAGG CTTGTTGCTT CCCCACATCA GAGGCATCTA ATGATTTATC TTTTGTAATT 540 

GCTGTGAACT TTTTTAAATA AGCCATTTAG TGTGAAATTG TCATGTATCA AATGGCTATT 600 

GGAAATGGAC TTTACTCAAT TTTAATTCCA CTGCACTCTA GCCGGAGTGA CAGAGTAAGA 660 

CTCTGTCTCA AAAATAAATA AATAAATAAA TAAATAAATA AATAAATAAA TAAATAAAAA 720 

ATAATAATAC AAGTTTTCAT AAGGAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAA 775 
(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
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Met Thr Asn Leu Asn Tyx Gly Ser Cys Pro Gin Tyr Lys lie Leu Lys 
15 10 15 

Lys His Tyr lie Val lie lie Ser Tyr Leu Val Val Arg Leu Val Ala 

20 25 30 

Ser Pro His Gin Arg His Leu Met lie Tyr Leu Leu 
35 40 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAAAAAGAAG ACAAAGCTCA CCTTCAGGCG GAGGTGCAGC ACCTGCGAGA GGACAACCTG 60 

AGGCTACAGG AGGAGTCCCA GAACGCCTCG GACAAGCTGA AGAAGTTCAC AGAATGGGTC 120 

TTCAACACCA TAGACATGAG CTAGGGAAGG CTGAGGAGGA CAGGAGAAGG GCCCAGACAC 180 

TCCCTCCAGT GAGTGTCCTG CAGCCCTTAT TCCCTCCATA GAAAGCATCC TCAGAGCACC 240 

TTCCCTGGCT TCCTACTCTG CCCCCTTTCG GGGAGTGCAC AACACAATAG TTGCAGATCA 300 

ACAATCATCA CCTGCCTTTT GTAGAAAAGA AAAACAAAAA AAGTAAATAA AAATTTTAAA 360 

CAGTAAAATA AAAGTTTAAC TGCTAAAATG TGAATGTCTT TATTTTTTTG CACAATATCT 420 

TTATCTGTTA TGTATTTAAG AAGAAACTGG GCCTTGGACC AGGGCGCCCC CTGGCCCATC 480 

CGCCTCTATT CCCATCAGCT TTCTTATCAA CTTCAGGTAA CCCAAGCTTT CCCTTGTTAT 540 

TCTAACAAAT ATCATTATTC CTAGAAAAAG AATGTTTTTA TAACTTGTTT GGGGAGTAGA 600 

GAGGGATATT TCCTTACCTT CTTCCCTAAA ATGCCTGGAG AGGGAGTTGC TTTGAGAAAA 660 

TGCCTACCTC CCTTGAATGA CTCGTGCATG AGCTAGTGCT GTCTGTACCT GTCCTCCAGA 720 

GATCAGCAGG ACCGGAGTTA AATATTTAAC AGCAAGTCTG TAAACCAGAG CAGCTCTGAC 780 

AGTGCCTGCA GGCCACACCC CTTCTCAGTC CTGCATTGTG AGGTCATTTC CTGCTTCTCC 840 

CTTTCCCCAG GAAGATGGTC CCACTTGTGC TGCAAGACTC TTTTTTGTTT GGCTTAATTG 900 

AGCCCCACTA AATTGGAATC AATCTCTCTT ACAGCTTCCT GGCTCCAAAC ATTAATTGAT 960 
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TTCAGAATTC CCCCAAACTA AAACCTTATC 
AGTATACGTT TTAAAGATTT TTAAAGATAA 
CTGAAAATGA CCATATCTCT AAATTAATCT 
TGTAACTAGG AGAAGCTAAG TGAATGTTTA 
ATAATGACTG TGATACTAGA ATGGGCTTTT 
GCACGGCATT TTCTGCATCC TTTCATGGCC 
ATGTGCTATT TTTGCTGTCT TGTGATGCAG 
AGGCTGCTGT AGAGCACAGA GACATGGGGC 
CTTTGTTTGT TGCCTTCATA ACTGTCACTA 
TCTGGTCTAA GGGAACATTC AGCACTCTAG 
GTAATCTCAA TTCCTTCCTC TCTCCATCCC 
CTCCCTGCTA CATTCACTAC CAGATTTTTA 
TCCATGGAAT TTTTTTTTTC TGGTGACATT 
GCTTTGTGAA GATTTCAATT GTCTGTCTCT 
TACATTGCCT GATATCTCTT TGTAAATGAG 
TCTTGCTTAT CCTTCTGAGT TTAGTTCTCA 
GGCAAGCTAT TTATTAGAGT TTTGCAACAG 
TGTAAAATTC AAAGAATAAA ATTCATTTTA 
AAAAAAAAAA AAAAAAAAAA 
(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 62 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE : protein 
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TTGAATGCAT TTTGGTCAAA 1020 

CAACTGGTTT TTTTAGCTTG 1080 

AGCAAGACTT CACCAGTATT 1140 

TTAATCATTG CTTGTTAGGA 1200 

TGTCCCAGTG TGAAATTTCA 1260 

TTCCGCTGCA GAAATTATTG 1320 

GCCCCTGGGT CACTCTTCCA 1380 

TGACTGACCT GAGGAGACCC 1440 

TCTGCCCTTT CAGTGGCAAC 1500 

TTGGAAGTTC CCTCACCCAA 1560 

AGGATGGATT TTCCTCTCTT 1620 

TCATTCTTGA TTGTGATTTC 1680 

AAATAGGAAG ATTTCGGAGT 1740 

GACTTGTATG AAGGAGATTG 1800 

AACATCCAAG CATTCTGAAG 1860 

CATTTTGTTT GGGGACTTGG 1920 

TGAAGCCTCT AAAGACTACC 1980 

TAAAAAAAAA AAAAAAAAAA 2040 

2060 



TGTCTGCATT 
AAATGTGGCA 
TTCTCTCCAG 
ATTGTGAATT 
GAAACCTGCA 
ATCCAAAGGA 
GCTGCTTTGG 
TGGCCAGTGT 
AACCGACCCC 
CGGCATCTGA 
TGAAAGAAAC 
TGCTACAGTT 
TCTATCATGG 
TTCTCTCTTT 
AAATATTGCT 
TTTTGTTTTA 
AGTTCTTGTT 
AACGCTCTTT 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Met Thr Val lie Leu Glu Trp Ala Phe Glu Thr Cys Met Ser Gin Cys 
15 10 15 
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Glu He Ser Ala Arg His Phe Leu His Pro Phe Met Ala He Gin Arg 

20 25 30 

He Pro Leu Gin Lys Leu Leu Met Cys Tyr Phe Cys Cys Leu Val Met 
35 40 45 

Gin Ala Ala Leu Gly Pro Trp Val Thr Leu Pro Arg Leu Leu 
50 55 60 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1160 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

GATTATTTTC AGTAGGCAGA CATCTAATCG GAATCTTGCT CTTGTTGCCC AGGCTGGAGT 60 

GTAATGGCAC AATCTCGGCT TACTGCAACC TCTGCCTCCT GGATTCAAGT GATTCTCCTG 120 

CCTCAGCCTC CCAAGTAGCT GGGATTACAG CCCTGAAAAC CACTCGCTTG CAGAGCGCTG 180 

GATCAGCAAT GCCTACTAGT TCTTCATTCA AACACCGGAT TAAAGAGCAG GAAGACTACA 240 

TCCGAGATTG GACTGCTCAT CGAGAAGAGA TAGCCAGGAT CAGCCAAGAT CTTGCTCTCA 300 

TTGCTCGGGA GATCAACGAT GTAGCAGGAG AGATAGATTC AGTGACTTCA TCAGGCACTG 360 

CCCCTAGTAC CACAGTAAGC ACTGCTGCCA CCACCCCTGG CTCTGCCATA GACACTAGAG 420 

AAGAGTTGGT TGATCGTGTT TTTGATGAAA GCCTCAACTT CCAAAAGATT CCTCCATTAG 480 

TTCATTCCAA AACACCAGAA GGAAACAACG GTCGATCTGG TGATCCAAGA CCTCAAGCAG 540 

CAGAGCCTCC CGATCACTTA ACAATTACAA GGCGGAGAAC CTGGAGCAGG GATGAAGTCA 600 

TGGGAGATAA TCTGCTGCTG TCATCCGTCT TTCAGTTCTC TARGAAGATA AGACAATCTA 660 

TAGATAAGAC AGCTGGAAAG ATCAGAATAT TATTTAAAGA CAAAGATCGG AATTGGGATG 720 

ACATAGAAAG CAAATTAAGA GCCGAAAGTG AAGTCCCTAT TGTGAAAACC TCGAGCATGG 780 

AGATTTCTTC TATCTTACAG GAACTGAAAA GAGTAGAAAA GCAGCTACAA GCAATCAATG 840 

CTATGATTGA TCCTGATGGA ACTTTGGAGG CTCTGAACAA CATGGGATTT CCCAGTGCTA 900 

TGTTGCCATC TCCACCGAAA CAGAAGTCCA GCCCTGTGAA TAACCACCAC AGCCCGGGTC 960 
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AGACACCAAC ACTTGGCCAA CCAGAAGCTA GGGCTCTTCA TCCTGCTGCT GTTTCAGCCG 1020 

CAGCTGAATT TGAGAATGCT GAATCTGAGG CTGATTTCAG TATACATTTC AATAGAGTCA 1080 

ACCCTGATGG GGAAGAGGAA GATGTTACAG TAACATAAAT GACTTTCTCT TGATTGTTGA 1140 

AAAAAAAAAA AAAAAAAAAA 1160 
(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 309 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Met Pro Thr Ser Ser Ser Phe Lys His Arg lie Lys Glu Gin Glu Asp 
1 5 10 15 

Tyr lie Arg Asp Trp Thr Ala His Arg Glu Glu He Ala Arg He Ser 

20 25 30 

Gin Asp Leu Ala Leu He Ala Arg Glu He Asn Asp Val Ala Gly Glu 
35 40 45 

He Asp Ser Val Thr Ser Ser Gly Thr Ala Pro Ser Thr Thr Val Ser 
50 55 60 

Thr Ala Ala Thr Thr Pro Gly Ser Ala He Asp Thr Arg Glu Glu Leu 
65 70 75 80 

Val Asp Arg Val Phe Asp Glu Ser Leu Asn Phe Gin Lys He Pro Pro 

85 90 95 

Leu Val His Ser Lys Thr Pro Glu, Gly Asn Asn Gly Arg Ser Gly Asp 

100 105 110 

Pro Arg Pro Gin Ala Ala Glu Pro Pro Asp His Leu Thr He Thr Arg 
115 120 125 

Arg Arg Thr Trp Ser Arg Asp Glu Val Met Gly Asp Asn Leu Leu Leu 
130 135 140 

Ser Ser Val Phe Gin Phe Ser Xaa Lys He Arg Gin Ser He Asp Lys 
145 150 155 160 

Thr Ala Gly Lys He Arg He Leu Phe Lys Asp Lys Asp Arg Asn Trp 

165 170 175 
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Asp Asp lie Glu Ser Lys Leu Arg Ala Glu Ser Glu Val Pro lie Val 

180 185 190 

Lys Thr Ser Ser Met Glu lie Ser Ser lie Leu Gin Glu Leu Lys Arg 
195 200 205 

Val Glu Lys Gin Leu Gin Ala lie Asn Ala Met lie Asp Pro Asp Gly 
210 215 220 

Thr Leu Glu Ala Leu Asn Asn Met Gly Phe Pro Ser Ala Met Leu Pro 
225 230 235 240 

Ser Pro Pro Lys Gin Lys Ser Ser Pro Val Asn Asn His His Ser Pro 

245 250 255 

Gly Gin Thr Pro Thr Leu Gly Gin Pro Glu Ala Arg Ala Leu His Pro 

260 265 270 

Ala Ala Val Ser Ala Ala Ala Glu Phe Glu Asn Ala Glu Ser Glu Ala 
275 280 285 

Asp Phe Ser lie His Phe Asn Arg Val Asn Pro Asp Gly Glu Glu Glu 
290 295 300 

Asp Val Thr Val Thr 
305 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1536 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

GAAGAGAGAA AATCAGCCTG TCTGCTCTCT CCTTGGCTCA ACAAGGCCTC TAACAGTCTT 60 

CTGTCCTCTA TTCTGCACAC GGCATATTTG GGAACGAGAA ACAAAAGTTT TCCCAAATGA 120 

AGAGAACTCA CTTGTTTATT GTGGGGATTT ATTTTCTGTC CTCTTGCAGG GCAGAAGAGG 180 

GGCTTAATTT CCCCACATAT GATGGGAAGG ACCGAGTGGT AAGTCTTTCC GAGAAGAACT 240 

TCAAGCAGGT TTTAAAGAAA TATGACTTGC TTTGCCTCTA CTACCATGAG CCGGTGTCTT 300 

CAGATAAGGT CACGCANAAA CAGTTCCAAC TGAAAGAAAT CGTGCTTGAG CTTGTGGCCC 360 

ACGTCCTTGA ACATAAAGCT ATAGGCTTTG TGATGGTGGA TGCCAAGAAA GAAGCCAAGC 420 
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TTGCCAAGAA ACTGGGTTTT GATGAAGAAG GAAGCCTGTA TATTCTTAAG GGTGATCGCA 480 

CAATAGAGTT TGATGGCGAG TTTGCAGCTG ATGTCTTGGT GGAGTTCCTC TTGGATCTAA 540 

TTGAAGACCC AGTGGAGATC ATCAGCAGCA AACTGGAAGT CCAAGCCTTC GAACGCATTG 600 

AAGACTACAT CAAACTCATT GGCTTTTTCA AGAGTGAGGA CTCAGAATAC TACAAGGCTT 660 

TTGAAGAAGC AGCTGAACAC TTCCAGCCTT ACATCAAATT CTTTGCCACC TTTGACAAAG 720 

GGGTTGCAAA GAAATTATCT TTGAAGATGA ATGAGGTTGA CTTCTATGAG CCATTTATGG 780 

ATGAGCCCAT TGCCATCCCC AACAAACCTT ACACAGAAGA GGAGCTGGTG GAGTTTGTGA 840 

AGGAACACCA AAGGTGCCTG AGATGGCATG TGGGGGCTGG GGGCCTGGGG TCTGGGGAAT 900 

GGAGAGGAGC CTCTCTGTGC TAACATTTCA GACCTGCCAA GAGCAACAAC CTAGTTAGTA 960 

CCCCAGCAGT ACAGAACTCA GTAGTATGGC TTTGTTGATC AGTAATGACT AGCAGGGATG 1020 

TTATTACTTC TGAATCTAAG TCTGCACCTG CAAGCAGAGT TTGATAAATC CCTCAGTCAG 1080 

CAAATCCCCT CAAAGCCAGG GCAAGATATA AATAAAATTC TATACTAGGA ATGAGAGCAA 1140 

TTTAGTGAAA GTTCCCATAT ACCAATAACC ATGCCCAGTG CTTTAGGGAA ACTATTTTAT 1200 

CTAATCTCCA ACCTTAGGGA GTAATTATTA TTATCCCAAT TTTACAGATC AAGGAATTGG 1260 

ACTCAATAGT TAAGTAACTT AGCCAAGGAT GAACACTCTA TGCATAGAAC TTCTGGGAGA 1320 

GAAATGCTTG ATACCACTTA GTGTAGCTCC AGCATGGATC AGCAAACTTT TTCTGTAAAG 1380 

AACAAAATGG TAAATATTTC AGGTTCTGTG GGCCAGATGG CGTCTGTAGC AACTACTTAA 1440 

CTGCGGCTGT GGCATGAAAG CAGCCATGGA TCATGTATAA ACAAATGGGT GTGGCTGTGT 1500 

ACCAGTAAAA GTTTATTTAG GAAAAAAAAA AAAAAA 1536 
(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 268 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Met Lys Arg Thr His Leu Phe lie Val Gly lie Tyr Phe Leu Ser Ser 
15 10 15 
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Cys Arg Ala Glu Glu Gly Leu Asn Phe Pro Thr Tyr Asp Gly Lys Asp 

20 25 30 

Arg Val Val Ser Leu Ser Glu Lys Asn Phe Lys Gin Val Leu Lys Lys 
35 40 45 

Tyr Asp Leu Leu Cys Leu Tyr Tyr His Glu Pro Val Ser Ser Asp Lys 
50 55 60 

Val Thr Xaa Lys Gin Phe Gin Leu Lys Glu lie Val Leu Glu Leu Val 
65 70 75 80 

Ala His Val Leu Glu His Lys Ala lie Gly Phe Val Met Val Asp Ala 

85 90 95 

Lys Lys Glu Ala Lys Leu Ala Lys Lys Leu Gly Phe Asp Glu Glu Gly 

100 105 110 

Ser Leu Tyr lie Leu Lys Gly Asp Arg Thr He Glu Phe Asp Gly Glu 
115 120 125 

Phe Ala Ala Asp Val Leu Val Glu Phe Leu Leu Asp Leu He Glu Asp 
130 135 140 

Pro Val Glu He He Ser Ser Lys Leu Glu Val Gin Ala Phe Glu Arg 
145 150 155 160 

He Glu Asp Tyr He Lys Leu He Gly Phe Phe Lys Ser Glu Asp Ser 

165 170 175 

Glu Tyr Tyr Lys Ala Phe Glu Glu Ala Ala Glu His Phe Gin Pro Tyr 

180 185 190 

He Lys Phe Phe Ala Thr Phe Asp Lys Gly Val Ala Lys Lys Leu Ser 
195 200 205 

Leu Lys Met Asn Glu Val Asp Phe Tyr Glu Pro Phe Met Asp Glu Pro 
210 215 220 

He Ala He Pro Asn Lys Pro Tyr Thr Glu Glu Glu Leu Val Glu Phe 
225 230 235 240 

Val Lys Glu His Gin Arg Cys Leu Arg Trp His Val Gly Ala Gly Gly 

245 250 255 

Leu Gly Ser Gly Glu Trp Arg Gly Ala Ser Leu Cys 

260 265 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1009 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(ii) MOLECULE 



TYPE: 



CDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

GAATTCGGCC TTCATGCGCC TGCAGGAAAG AATCTGACAT CATCACACTG TGTTTTCCTT 60 

AACITGACAG GAAGTCAACT TCAAGCAGAT TGACTTGAAA CGGGATCTCA TTTAGGAAGC 120 

ATAAGTGTCC AATCAAAAAC TGTGTATTTT TTTAAATTTG GAAAATACTC AAGTTCCAGT 180 

TGCTTATCAT TCTCCTTCAC TTTCTGAAAA CCTGGCAATC CCATGTGGAC TTCTGGTAGA 240 

ATGAGCAATG CAAAGAACTG GCTTGGACTT GGCATGTCCT TGTACTTCTG GGGGCTGATG 300 

GACCTTACGA CCACCGTTCT CTCGGACACC CCAACACCAC AAGGTGAATT AGAAGCACTC 360 

CTGTCAGACA AGCCACAGTC ACATCAGCGG ACCAAGARGA GCTGGGTTTG GAACCAGTTT 420 

TTCGTTCTGG AAGAGTACAC TGGGACCGAC CCTTTGTATG TCGGCAAGGT AAGAAATGCC 480 

AAGTAGAAAT GACCCGGGTA GTGGATATTG AAATTGAATA TGAATTGAGT ATCAAAGTTG 540 

ACCTAGCCTT TATYTGAGAC CTGAGAAAAA CTAGAACAAG TGGTACGTTA CTTGACACCT 600 

AGCTAAAATG TAACTTCTGC TTTGTCAGAG ACCAGTCTGA AAGGAAAGAT TTATTTCCTT 660 

GTCCATGTCT CTGGTATGAA TGGGAAAAAG TGGGAATTGG GATTTGGAGG AAAAGGCTCA 720 

GACCCTGCAA GAGCTATTCA AGTCCTAAAA GAGGCAGCAG CAGCTGTCTG GGAATGACAG 780 

AATGGGGGAG AGGGAAACTT GGAAATACAA GAAGAGTACA GAGTTTTTTG CTTTGTGTTT 840 

TTATGGGGTT TTTTTCAGAG CATTTCCAGA GGTATTGCCT GAGGTGCAAA TTTAATGAAA 900 

AAAATAAAAA TAAAACATTT TTCATTTCAG AAGATCTTAG CATGTGCTTT AGGATAGTTG 960 

GAGACAATAA ATATATTTAT AAATGTTAAA AAAAAAAAAA AAAAAAAAA 1009 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 87 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
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Met Trp Thr Ser Gly Arg Met Ser Asn Ala Lys Asn Trp Leu Gly Leu 
15 10 15 

Gly Met Ser Leu Tyr Phe Trp Gly Leu Met Asp Leu Thr Thr Thr Val 

20 25 30 

Leu Ser Asp Thr Pro Thr Pro Gin Gly Glu Leu Glu Ala Leu Leu Ser 
35 40 45 

Asp Lys Pro Gin Ser His Gin Arg Thr Lys Xaa Ser Trp Val Trp Asn 
50 55 60 

Gin Phe Phe Val Leu Glu Glu Tyr Thr Gly Thr Asp Pro Leu Tyr Val 
65 70 75 80 

Gly Lys Val Arg Asn Ala Lys 

85 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2546 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

* 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AAAAGAAACC AAGGAAATTT GTATGATAAG GCAGGTAAAG TGAGGAAACA TGCAACTGAA 
CAGGAAAAAA CTGAAGAGGG ATTAGGCCCT AATATAAAAA GCATTGTCAC CATGTTGATG 
CTGATGCTAT TGATGATGTT TGCTGTCCAC TGTACCTGGG TCACAAGCAA TGCCTACTCT 
AGTCCAAGTG TAGTCCTGGC CTCATACAAT CATGATGGCA CCAGGAATAT CTTAGATGAT 
TTTAGAGAAG CTTACTTTTG GCTAAGGCAA AATACAGATG AACATGCACG AGTAATGTCT 
TGGTGGGATT ATGGCTATCA GATAGCTGGA ATGGCTAATA GAACTACGTT GGTGGATAAT 
AACACCTGGA ATAACAGCCA CATAGCACTG GTGGGAAAAG CTATGTCTTC TAATGAAACA 
GCAGCCTATA AAATCATGAG GACTCTAGAT GTAGATTATG TTTTGGTTAT TTTTGGAGGG 
GTTATTGGCT ATTCTGGTGA TGATATCAAC AAATTTCTCT GGATGGTTAG GATAGCTGAA 
GGAGAACATC CCAAAGACAT TCGGGAAAGT GACTATTTTA CCCCACAGGG AGAATTCCGT 
GTAGACAAAG CAGGATCCCC TACTTTGTTG AATTGCCTTA TGTATAAAAT GTCATACTAC 
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AGATTTGGAG AAATGCAGCT GGATTTTCGT 
GCTGAGATTG GAAATAAGGA CATTAAATTC 
CACTGGCTTG TTAGGATATA TAAAGTAAAA 
AAACCTCGAG TCACCAACAT TTTCCCAAAA 
AGGAAGCGTG GCTACATTAA AAATAAGCTG 
AAGACTGTTT AAATGCACTG TTCTGGTTCC 
GGTCTTTGCC TTTAGCTCAT GTCGTGTTTC 
TCCAGGTTAA TGTACAAAAT TTTCTGGCAA 
TGAGAACAGC TGTTTTCGAT TTCTAATGTG 
AGCAGCAGAT TTTTTTTTTA TTGGTACATA 
CTGCACCAAA GAACCCTCTA ATTTGGTCCC 
TCTCTTACAA GACCTGCATT TTATTTGAAT 
GACAAAATGT GATGAGAGCT TCTTGAACCG 
TTAGAAGAAT CATTATACTT CCCTGAATTA 
AATGTTTCAT CAAATGTATA TTTTCTGTTG 
TTTCAAAGAT GGACCTCTGA GTAGCTAATT 
TATTACTGCT GTTAGCAGTC GTTTTCACCA 
TCATTCAAGG GCTAAATTTA TATTTTTTGG 
ACCAGGACGG AGTTTTGGTA TCTTAGTACT 
TTAAGGAAAA AACCCTTAAA GTGGACAGGT 
AGTGACAAAA GACTTGTAAC AACTTTGCCT 
ATTCACAATG ATTTTGTTCT CTGCTCCATA 
ACTCTTTAGT GCTATATGCA TTTTCTTACT 
TGCACTAAAA TGTAAATGGA GATTGAACAA 
ACAGACTTGA ACATTTTCTC CAGTTGTTTA 
CCACAGGATA TAACTTTTTT TTATATAACA 
AGTGCAGAAT AGTATGTACC TTTTATGAAG 
TGTTACTGCT GATTTTCTTT TCCAAGGTGT 
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ACACCCCCAG GTTTTGACCG AACACGTAAT 720 

AAACATTTGG AAGAAGCCTT TACATCAGAA 780 

GCACCTGATA ACAGGGAGAC ATTAGATCAC 840 

CAGAAGTATT TGTCAAAGAA GACTACCAAA 900 

GTTTTTAAGA AAGGCAAGAA AATATCTAAG 960 

TAACTTGAAG CAGTTGTCCT TGTGAGAACC 1020 

ACAGCAAAGA GGGTACAGAA CCATCACTGG 1080 

TGCCTGATTA AAAAAATAAA ATTGGCTTGT 1140 

AAGCAAGACA GAGCACTGCT GTAAATGTCT 1200 

TTATCCTTCA AATCTGAGAA TTTGGACTAA 1260 

TGGCACATGC ATACTTGTCA ATGTTTTTAT 1320 

TACCCGAATA GCAATATGTA AAATACAAGT 1380 

GTAAACTAGT ACAGGTCTGA GAAAGACATA 1440 

TATTTATTTT CATGTTTCTC TAATGCAAAG 1500 

CTTACTGTTT GCTCTGAGAA GAAGCTGCTG 1560 

GATTCAAGTA GTTTTTTTAT GTTGACACAT 1620 

GGTACTTACA GAGCAGATTT CATACATCAT 1680 

AAATCATGGC AACTACACAG GATGTTGCTT 1740 

GAAGTTAGCA CTATGTTTAC ATGCAAAAGA 1800 

ATCCAAAGTT CATTTTCTGT GACTCATCAA 1860 

GGACTTTTTT CATTTTACAA CAGTTCATCC 1920 

TTTTTTAATC CCTTAAGCAT TTGATGAAAC 1980 

TTTGTTAAAA ATGTGACAAT TGTCAAAAAA 2040 

GTTCACTTTC CAGCTTATAG GCAACTTTAT 2100 

GTAAAAGTGA AAGAGAAAGG GTTTTTCCTG 2160 

AGCATAACAC ACCACTGCTT TTGGTGGAAA 2220 

AAAAATGTAA TTTACAATAT TCAGTGAGAA 2280 

AGAATATTCT TTGATTTATA GAATTCATTT 2340 
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TTGACCCAGA TGATGGTTCC TTTACAGAAC AATAAAATGG CTGAACATTT TCACAAATAG 2400 
AGTGTAACGA AGTCTGGATT TCTGATACCT TGTCATTTGG GGGATTTTAT TTTACTTTGT 2460 
TGCTTTAAAA TTCAATGCAG AGAAGTTGTT GACTGTAGGG GAAATAAAGT TAATTCAAAT 2520 



TTTGAAAAAA AAAAAAAAAA AAAAAA 



2546 



(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 286 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Leu Met Leu Met Leu Leu Met Met Phe Ala Val His Cys Thr Trp 
15 10 15 

Val Thr Ser Asn Ala Tyr Ser Ser Pro Ser Val Val Leu Ala Ser Tyr 

20 25 30 

Asn His Asp Gly Thr Arg Asn lie Leu Asp Asp Phe Arg Glu Ala Tyr 
35 40 45 

Phe Trp Leu Arg Gin Asn Thr Asp Glu His Ala Arg Val Met Ser Trp 
50 55 60 

Trp Asp Tyr Gly Tyr Gin lie Ala Gly Met Ala Asn Arg Thr Thr Leu 
65 70 75 . 80 

Val Asp Asn Asn Thr Trp Asn Asn Ser His lie Ala Leu Val Gly Lys 

85 90 95 

Ala Met Ser Ser Asn Glu Thr Ala Ala Tyr Lys lie Met Arg Thr Leu 

100 105 110 

Asp Val Asp Tyr Val Leu Val lie Phe Gly Gly Val lie Gly Tyr Ser 
115 120 125 

Gly Asp Asp lie Asn Lys Phe Leu Trp Met Val Arg lie Ala Glu Gly 
130 * 135 140 

Glu His Pro Lys Asp lie Arg Glu Ser Asp Tyr Phe Thr Pro Gin Gly 
145 150 155 160 

Glu Phe Arg Val Asp Lys Ala Gly Ser Pro Thr Leu Leu Asn Cys Leu 

165 170 175 
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Met Tyr Lys Met Ser Tyr Tyr Arg Phe Gly Glu Met Gin Leu Asp Phe 

180 185 190 

Arg Thr Pro Pro Gly Phe Asp Arg Thr Arg Asn Ala Glu lie Gly Asn 
195 200 205 

Lys Asp lie Lys Phe Lys His Leu Glu Glu Ala Phe Thr Ser Glu His 
210 215 220 

Trp Leu Val Arg lie Tyr Lys Val Lys Ala Pro Asp Asn Arg Glu Thr 
225 230 235 240 

Leu Asp His Lys Pro Arg Val Thr Asn lie Phe Pro Lys Gin Lys Tyr 

245 250 255 

Leu Ser Lys Lys Thr Thr Lys Arg Lys Arg Gly Tyr lie Lys Asn Lys 

260 265 270 

Leu Val Phe Lys Lys Gly Lys Lys lie Ser Lys Lys Thr Val 
275 280 285 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4061 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

AAGCAATTGA AGAAATTGCA GCAGGATGTG ATGGAAATGA AAAAAACAAA GGTTCGCCTA 60 

ATGAAACAAA TGAAAGAAGA ACAAGAGAAA GCCAGACTGA CTGAGTCTAG AAGAAACAGA 120 

GAGATTGCTC AGTTGAAAAA GGATCAACGT AAAAGAGATC ATCMACTTAG ACTTCTGGAA 180 

GCCCAAAAAA GAAACCAAGA AGTGGTTCTA CGTCGCAAAA CTGAAGAGGT TACGGCTCTT 240 

CGTCGGCAAG TAAGACCCAT GTCAGATAAA GTGGCTGGGA AAGTTACTCG GAAGCTGAGT 300 

TCATCTGATG CACCTGCTCA GGACACAGGT TCCAGTGCAG CTGCTGTCGA AACAGATGCA 360 

TCAAGGACAG GAGCCCAGCA GAAAATGAGA ATTCCTGTGG CGAGAGTCCA GGCCTTACCA 420 

ACGCCGGCAA CAAATGGAAA CAGGAAAAAA TATCAGAGGA AAGGATTGAC TGGCCGAGTG 480 

TTTATTTCCA AGACAGCTCG CATGAAGTGG CAGCTCCTTG AGCGCAGGGT CACAGACATC 540 

ATCATGCAGA AGATGACCAT TTCCAACATG GAGGCAGATA TGAATAGACT CCTCAAGCAA 600 
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CGGGAGGAAC TCACAAAAAG ACGAGAGAAA CTTTCAAAAA GAAGGGAGAA GATAGTCAAG 660 

GAGAATGGAG AGGGAGATAA AAATGTGGCT AATATCAATG AAGAGATGGA GTCACTGACT 720 

GCTAATATCG ATTACATCAA TGACAGTATT TCTGATTGTC AGGCCAACAT AATGCAGATG 780 

GAAGAAGCAA AGGAAGAAGG TGAGACATTG GATGTTACTG CAGTCATTAA TGCCTGCACC 840 

CTTACAGAAG CCCGATACCT GCTAGATCAC TTCCTGTCAA TGGGCATCAA TAAGGGTCTT 900 

CAGGCTGCCC AGAAAGAGGC TCAAATTAAA GTACTGGAAG GTCGACTCAA ACAAACAGAA 960 

ATAACCAGTG CTACCCAAAA CCAGCTCTTA TTCCATATGT TGAAAGAGAA GGCAGAATTA 1020 

AATCCTGAGC TAGATGCTTT ACTAGGCCAT GCTTTACAAG ATCTAGATAG CGTACCATTA 1080 

GAAAATGTAG AGGATAGTAC TGATGAGGAT GCTCCTTTAA ACAGCCCAGG ATCAGAAGGA 1140 

AGCACGCTGT CTTCAGATCT CATGAAGCTT TGTGGTGAAG TGAAACCTAA GAACAAGGCC 1200 

CGAAGGAGAA CCACCACTCA GATGGAATTG CTGTATGCAG ATAGCAGTGA ACTAGCTTCA 1260 

GACACTAGTA CAGGAGATGC CTCCTTGCCT GGCCCTCTCA CACCTGTTGC AGAAGGGCAA 1320 

GAGATTGGAA TGAATACAGA GACAAGTGGT ACTTCTGCTA GGGAAAAAGA GCTCTCTCCC 1380 

CCACCTGGCT TACCTTCTAA GATAGGCAGC ATTTCCAGGC AGTCATCTCT ATCAGAAAAA 1440 

AAAATTCCAG AGCCTTCTCC TGTAACAAGG AGAAAGGCAT ATGAGAAAGC AGAAAAATCA 1500 

AAGGCCAAGG AACAAAAGCA CTCAGATTCT GGAACTTCAG AGGCTAGTCT TTCACCTCCT 1560 

TCTTCCCCAC CAAGCCGGCC CCGTAATGAA CTGAATGTTT TTAATCGTCT TACTGTTTCT 1620 

CAGGGAAACA CATCAGTTCA GCAGGATAAG TCTGATGAAA GTGACTCCTC TCTCTCGGAG 1680 

GTACACAGCA GATCCTCCAG AAGGGGCATA ATCAACCCAT TTCCTGCTTC AAAAGGAATC 1740 

AGAGCTTTTC CACTTCAGTG TATTCACATA GCTGAAGGGC ATACAAAAGC TGTGCTCTGT 1800 

GTGGATTCTA CTGATGATCT CCTCTTCACT GGATCAAAAG ATCGTACTTG TAAAGTATGG 1860 

AATCTGGTGA CTGGGCAGGA AATAATGTCA CTGGGGGGTC ATCCCAACAA TGTCGTGTCT 1920 

GTAAAATACT GTAATTATAC CAGTTTGGTC TTCACTGTAT CAACATCTTA TATTAAGGTG 1980 

TGGGATATCA GAGATTCAGC AAAGTGCATT CGAACACTAA CGTCTTCAGG TCAAGTTACT 2040 

CTTGGAGATG CTTGTTCTGC AAGTACCAGT CGAACAGTAG CTATTCCTTC TGGAGAGAAC 2100 

CAGATCAATC AAATTGCCCT AAACCCAACT GGCACCTTCC TCTATGCTGC TTCTGGAAAT 2160 

GCTGTCAGGA TGTGGGATCT TAAAAGGTTT CAGTCTACAG GAAAGTTAAC AGGACACCTA 2220 

GGCCCTGTTA TGTGCCTTAC TGTGGATCAG ATTTCCAGTG GACAAGATCT AATCATCACT 2280 
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GGCTCCAAGG ATCATTACAT CAAAATGTTT 
AGTCCCACCC ACAATTTTGA ACCCCCTCAT 
GGGGATAACC TATTTAGTGG GTCTAGAGAT 
AAAGACCTTC TTCAGCAAGT TCCAAATGCA 
GTGCCAGACC ACCCAGTTTT GCTCAGTGGC 
ATGGATACTT TTATGCCAGT GGGAGAGATG 
TGTGTTAATT CCACCCACAT TTTTACTGCA 
GCTCGCAATT TGCAAGATGG TCAGATCTCT 
AGTAATTAAA CATGAATGAA GATAGGTTGT 
CTTTATGGAA AATGTTGTCC TGTACTTACT 
AAATATATCT GAATTCAACT GCTGACTATA 
CTGTGTGCTT AGTTTTAAGA TCAACCAATA 
TATTCACACT TTTATTGTGG CTGAATTTTT 
ATTTGAATTA CCAAGACGTC TGCTTTTGTG 
CAGCCCACCA TTAACTCATT CATGTAGTTT 
ATTGAAAATG TTAGTCAAAT TGTGGTTTGC 
ATTCCCAAAT TTCAAAGCTC TTTTAATGTA 
TGATGAATTT TCATAAGGTG GAATTGATCT 
TGTGTTAAAT GTTGGCATTT ACTTGTATTG 
TGCTCAGGAC TAAAATGCTG TTAATTTTTT 
CTGTGATGGG AAACATTGTT GATGTCCTAA 
GAGTTGTTTC TATTATGACC ATTTAATTTT 
TTTTATCTAA TTTTTTATAT GCTGCTAAAT 
TGGTAGCTAT GATGAGAGCT ATTATCATCT 
TTGTATAGAG AGACTATCTT GTGTTGTGCC 
TTTTGCAAAA GGATGTATAA TATTTCTGTC 
AGTTTAATTT TTATGTAGAT GGTATAACAT 
CCTGAAAATA TTTGCTTGTA AATGAAAACT 
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GATGTTACAG AAGGAGCTCT TGGGACTGTG 2340 

TATGATGGCA TAGAAGCACT AACCATTCAA 2400 

AATGGAATCA AGAAATGGGA CTTAACTCAA 2460 

CATAAGGATT GGGTCTGTGC CCTGGGAGTG 2520 

TGCAGAGGGG GCATTTTGAA AGTCTGGAAC 2580 

AAGGGTCATG ATAGTCCTAT CAATGCCATA 2640 

GCTGATGATC GAACTGTGAG AATTTGGAAG 2700 

GACACAGGAG ATCTGGGGGA AGATATTGCC 2760 

AAACTGAATG CTGTGATAAT ACTCTGTATT 2820 

AGGCAAAACG TATGAATCGG ATTAACTGGA 2880 

AATGGTATTC TAATAAAATT GTGTACTATC 2940 

GATATATATC CTACAATTGA TATATTGCTT 3000 

GTGCCTATCT ATAAAACACA CTTTCAAATT 3060 

ACAGTCAGAA AACACACCTG GAATACGATG 3120 

ATTCAAGTGA TTTATGTATT TAAACTAAAT 3180 

TTGTCAGGTA TTTATATCAG TCTGTAGTGG 3240 

ATGGACAAAA ATAAGATATG AGAATATTAT 3300 

TAATCTACTA ACAGAGAAGG GTAGACAGTT 3360 

ACCAAAGTTT TGCAGCTCTA CTATATTCTG 3420 

TTTTTTTTTT TCCAGTGCTG TGCATATATT 3480 

CAGAAATATA TTTTGATCTA TTTTCCTATG 3540 

GTTTTTATTT AATAGTAGTA TTTCCTTCCC 3600 

ATATTTTAAA TATACTATGT TTGCGAACCT 3660 

GTGGTGGGAA AAGCTATGTA AATAGGTAGA 3720 

TGTATGAATT TTTAAAAGTT GTTGACTGGA 3780 

TGCTCAGAAT ATTAATTTGT AAATTCTGCA 3840 

TTGAAAATAT TGTCTTATGT GATTTTTTCC 3900 

TAGCTAGGGC TTAAATAAAC ATGTTGCTAT 3960 
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GAAATKRAAA AAAAAAAAAA AAAARAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 4020 
AARAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA A 4061 
:2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Met Lys Lys Thr Lys Val Arg Leu Met Lys Gin Met Lys Glu Glu Gin 
15 10 15 

Glu Lys Ala Arg Leu Thr Glu Ser Arg Arg Asn Arg Glu lie Ala Gin 

20 25 30 

Leu Lys Lys Asp Gin Arg Lys Arg Asp His Xaa Leu Arg Leu Leu Glu 
35 40 45 

Ala Gin Lys Arg Asn Gin Glu Val Val Leu Arg Arg Lys Thr Glu Glu 
50 55 60 

Val Thr Ala Leu Arg Arg Gin Val Arg Pro Met Ser Asp Lys Val Ala 
65 70 75 80 

Gly Lys Val Thr Arg Lys Leu Ser Ser Ser Asp Ala Pro Ala Gin Asp 

85 90 95 

Thr Gly Ser Ser Ala Ala Ala Val Glu Thr Asp Ala Ser Arg Thr Gly 

100 105 110 

Ala Gin Gin Lys Met Arg lie Pro Val Ala Arg Val Gin Ala Leu Pro 
115 120 125 

Thr Pro Ala Thr Asn Gly Asn Arg Lys Lys Tyr Gin Arg Lys Gly Leu 
130 135 140 

Thr Gly Arg Val Phe lie Ser Lys Thr Ala Arg Met Lys Trp Gin Leu 
145 150 155 160 

Leu Glu Arg Arg Val Thr Asp lie lie Met Gin Lys Met Thr lie Ser 

165 170 175 

Asn Met Glu Ala Asp Met Asn Arg Leu Leu Lys Gin Arg Glu Glu Leu 

180 185 190 
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Thr Lys Arg Arg Glu Lys Leu Ser Lys Arg Arg Glu Lys He Val Lys 
195 200 205 

Glu Asn Gly Glu Gly Asp Lys Asn Val Ala Asn He Asn Glu Glu Met 
210 215 220 

Glu Ser Leu Thr Ala Asn He Asp Tyr He Asn Asp Ser He Ser Asp 
225 230 235 240 

Cys Gin Ala Asn He Met Gin Met Glu Glu Ala Lys Glu Glu Gly Glu . 

245 250 255 

Thr Leu Asp Val Thr Ala Val He Asn Ala Cys Thr Leu Thr Glu Ala 

260 265 270 

Arg Tyr Leu Leu Asp His Phe Leu Ser Met Gly He Asn Lys Gly Leu 
275 280 285 

Gin Ala Ala Gin Lys Glu Ala Gin He Lys Val Leu Glu Gly Arg Leu 
290 295 300 

Lys Gin Thr Glu He Thr Ser Ala Thr Gin Asn Gin Leu Leu Phe His 
305 310 315 320 

Met Leu Lys Glu Lys Ala Glu Leu Asn Pro Glu Leu Asp Ala Leu Leu 

325 330 335 

Gly His Ala Leu Gin Asp Leu Asp Ser Val Pro Leu Glu Asn Val Glu 

340 345 350 

Asp Ser Thr Asp Glu Asp Ala Pro Leu Asn Ser Pro Gly Ser Glu Gly 
355 360 365 

Ser Thr Leu Ser Ser Asp Leu Met Lys Leu Cys Gly Glu Val Lys Pro 
370 375 380 

Lys Asn Lys Ala Arg Arg Arg Thr Thr Thr Gin Met Glu Leu Leu Tyr 
385 390 395 400 

Ala Asp Ser Ser Glu Leu Ala Ser Asp Thr Ser Thr Gly Asp Ala Ser 

405 410 415 

Leu Pro Gly Pro Leu Thr Pro Val Ala Glu Gly Gin Glu He Gly Met 

420 425 430 

Asn Thr Glu Thr Ser Gly Thr Ser Ala Arg Glu Lys Glu Leu Ser Pro 
435 440 445 

Pro Pro Gly Leu Pro Ser Lys He Gly Ser He Ser Arg Gin Ser Ser 
450 455 460 

Leu Ser Glu Lys Lys He Pro Glu Pro Ser Pro Val Thr Arg Arg Lys 
465 470 475 480 

Ala Tyr Glu Lys Ala Glu Lys Ser Lys Ala Lys Glu Gin Lys His Ser 
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485 490 495 

Asp Ser Gly Thr Ser Glu Ala Ser Leu Ser Pro Pro Ser Ser Pro Pro 

500 505 510 

Ser Arg Pro Arg Asn Glu Leu Asn Val Phe Asn Arg Leu Thr Val Ser 
515 520 525 

Gin Gly Asn Thr Ser Val Gin Gin Asp Lys Ser Asp Glu Ser Asp Ser 
530 535 540 

Ser Leu Ser Glu Val His Ser Arg Ser Ser Arg Arg Gly lie lie Asn 
545 550 555 560 

Pro Phe Pro Ala Ser Lys Gly lie Arg Ala Phe Pro Leu Gin Cys lie 

565 570 575 

His lie Ala Glu Gly His Thr Lys Ala Val Leu Cys Val Asp Ser Thr 

580 585 590 

Asp Asp Leu Leu Phe Thr Gly Ser Lys Asp Arg Thr Cys Lys Val Trp 
595 600 605 

Asn Leu Val Thr Gly Gin Glu lie Met Ser Leu Gly Gly His Pro Asn 
610 615 620 

Asn Val Val Ser Val Lys Tyr Cys Asn Tyr Thr Ser Leu Val Phe Thr 
625 630 635 640 

Val Ser Thr Ser Tyr lie Lys Val Trp Asp lie Arg Asp Ser Ala Lys 

645 65C 655 

Cys lie Arg Thr Leu Thr Ser Ser Gly Gin Val Thr Leu Gly Asp Ala 

660 665 670 

Cys Ser Ala Ser Thr Ser Arg Thr Val Ala lie Pro Ser Gly Glu Asn 
675 680 685 

Gin lie Asn Gin lie Ala Leu Asn Pro Thr Gly Thr Phe Leu Tyr Ala 
690 695 700 

Ala Ser Gly Asn Ala Val Arg Met Trp Asp Leu Lys Arg Phe Gin Ser 
705 710 715 720 

Thr Gly Lys Leu Thr Gly His Leu Gly Pro Val Met Cys Leu Thr Val 

725 730 735 

Asp Gin lie Ser Ser Gly Gin Asp Leu lie lie Thr Gly Ser Lys Asp 

740 745 750 

His Tyr lie Lys Met Phe Asp Val Thr Glu Gly Ala Leu Gly Thr Val 
755 760 765 

Ser Pro Thr His Asn Phe Glu Pro Pro His Tyr Asp Gly lie Glu Ala 
770 775 780 
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Leu Thr lie Gin Gly Asp Asn Leu Phe Ser Gly Ser Arg Asp Asn Gly 
785 790 795 800 

lie Lys Lys Trp Asp Leu Thr Gin Lys Asp Leu Leu Gin Gin Val Pro 

805 810 815 

Asn Ala His Lys Asp Trp Val Cys Ala Leu Gly Val Val Pro Asp His 

820 825 830 

Pro Val Leu Leu Ser Gly Cys Arg Gly Gly lie Leu Lys Val Trp Asn 
835 840 845 

Met Asp Thr Phe Met Pro Val Gly Glu Met Lys Gly His Asp Ser Pro 
850 855 860 

lie Asn Ala lie Cys Val Asn Ser Thr His lie Phe Thr Ala Ala Asp 
865 870 875 880 

Asp Arg Thr Val Arg lie Trp Lys Ala Arg Asn Leu Gin Asp Gly Gin 

885 890 895 

lie Ser Asp Thr Gly Asp Leu Gly Glu Asp lie Ala Ser Asn 

900 905 910 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oigonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
TNCTCGGTTAT ATGGAGGACG AATAGACT 29 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION : /desc = "oligonucleotide" 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CTCTTTATGA GCATGATATG GCTTCAG 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
CNTGCTGCCCG ACTTAAACTG AGAACCAA 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GNGGACAACTC CCATAGTTGA GGTTTGTC 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
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TNGGAAGAGTG ACCCAGGGGC CCAAAGCA 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CNCTGCAAGCG AGTGGTTTTC AGGGCTGT 
(2) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
TNGAGCCAAGG AGAGAGCAGA CAGGCTGA 
(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
ANCCCCCAGAA GTACAAGGAC ATGCCAAG 
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(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CNATAGCATCA GCATCAACAT GGTGACAA 
(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
CTCCATTCTC CTTGACTATC TTCTCCC 
(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
TNCCTGGTGCC ATCATGATTG TATGAGGC 
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What is claimed is: 

1. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:l; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:l from nucleotide 22 to nucleotide 462; 

(c) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AJ1_1 deposited under accession number 
ATCC 98278; 

(d) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AJ1_1 deposited under accession number ATCC 98278; 

(e) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AJ1_1 deposited under accession number ATCC 
98278; 

(f) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AJ1_1 deposited under accession number ATCC 98278; 

(g) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:2; 

(h) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:2 having biological activity; 

(i) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(f) above; 

(j) a polynucleotide which encodes a species homologue of the protein 
of (g) or (h) above ; and 

(k) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(h). 

2. A composition of claim 1 wherein said polynucleotide is operably linked 
to at least one expression control sequence. 

3. A host cell transformed with a composition of claim 2. 

4. The host cell of claim 3, wherein said cell is a mammalian cell. 
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5. A process for producing a protein encoded by a composition of claim 2, 
which process comprises: 

(a) growing a culture of the host cell of claim 3 in a suitable culture 

medium; and 

(b) purifying said protein from the culture. 

6. A protein produced according to the process of claim 5. 

7. The protein of claim 6 comprising a mature protein. 

8. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:2; 

(b) the amino acid sequence of SEQ ID NO:2 from amino acid 52 to 
amino acid 147; 

(c) fragments of the amino acid sequence of SEQ ID NQ2; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
AJ1_1 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

9. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID NCfcZ 

10. The composition of claim 8, wherein said protein comprises the amino acid 
sequence of SEQ ID N02 from amino acid 52 to amino acid 147. 

11. The composition of claim 8, further comprising a pharmaceutically 
acceptable carrier. 

12. A method for preventing, treating or ameliorating a medical condition 
which comprises administering to a mammalian subject a therapeutically effective amount 
of a composition of claim 11. 
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13. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:l or 
SEQ ID N03. 

14. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:4; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:4 from nucleotide 7 to nucleotide 1647; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO.4 from nucleotide 1 to nucleotide 305; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone AQ73_3 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone AQ73__3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone AQ73_3 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone AQ73_3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NCh5; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID N05 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(g) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in 

15. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 
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(a) 
(b) 



the amino acid sequence of SEQ ID NCh5; 

the amino acid sequence of SEQ ID N05 from amino acid 1 to 




(c) fragments of the amino acid sequence of SEQ ID NO:5; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 



AQ73_3 deposited under accession number ATCC 98278; 
the protein being substantially free from other mammalian proteins. 

16. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:4. 

17. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:6; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:6 from nucleotide 62 to nucleotide 757; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:6 from nucleotide 357 to nucleotide 703; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BG142JL deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BG142_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BG142_1 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BG142_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:7; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:7 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 



95 



WO 98/25962 



PCT/US97/23224 



(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

18. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:7; 

(b) the amino acid sequence of SEQ ID NO:7 from amino acid 184 to 
amino acid 214; 

(c) fragments of the amino acid sequence of SEQ ID NQ7; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BG142JL deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

19. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:6. 

20. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:8; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NCh8 from nucleotide 404 to nucleotide 535; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:8 from nucleotide 1 to nucleotide 666; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone BV66_1 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BV66_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BV66_1 deposited under accession number 
ATCC 98278; 
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(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BV66_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:9; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:9 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

21. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:9; 

(b) the amino acid sequence of SEQ ID NO:9 from amino acid 1 to 
amino acid 38; 

(c) fragments of the amino acid sequence of SEQ ID NO:9; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BV66_1 deposited tinder accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

22. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:8. 

23. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:10; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:10 from nucleotide 1204 to nucleotide 1389; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:10 from nucleotide 881 to nucleotide 1380; 
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(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of done BV291_3 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone BV291_3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone BV291J3 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone BV291_3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO: 11; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:ll having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

24. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO.ll; 

(b) the amino acid sequence of SEQ ID NQ11 from amino acid 1 to 
amino acid 59; 

(c) fragments of the amino acid sequence of SEQ ID NOrll; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
BV291J3 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

25. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:10. 
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26. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:12; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 189 to nucleotide 1115; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:12 from nucleotide 1 to nucleotide 451; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CK201_1 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone GK201_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CK201_1 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CK201_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NQ13; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:13 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aMg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i). 

27. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:13; 

(b) the amino acid sequence of SEQ ID NO:13 from amino acid 1 to 
amino acid 88; 
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(c) fragments of the amino acid sequence of SEQ ID NO:13; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CK201JI deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

28. An isolated gene corresponding to the cDNA sequence of SEQ ID NO.12. 

29. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:14; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 117 to nucleotide 923; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 174 to nucleotide 923; 

(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:14 from nucleotide 1 to nucleotide 316; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CQ331__2 deposited under accession 
number ATCC 98278; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CQ331_2 deposited under accession number ATCC 98278; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CQ331_2 deposited under accession number 
ATCC 98278; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CQ331_2 deposited under accession number ATCC 98278; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NQ15; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:15 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-{h) above; 
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(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

30. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NQ15; 

(b) the amino acid sequence of SEQ ID NO:15 from amino acid 1 to 
amino acid 57; 

(c) fragments of the amino acid sequence of SEQ ID NO:15; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CQ331_2 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

31. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:14. 

32. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:16; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NChl6 from nucleotide 223 to nucleotide 483; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:16 from nucleotide 22 to nucleotide 397; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT550_1 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CT550_1 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of done CT550_1 deposited under accession number 
ATCC 98278; 
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(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CT550_1 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO:17; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO.17 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

(1) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i)- 

33. . A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO.17; 

(b) the amino acid sequence of SEQ ID NO:17 from amino acid 1 to 
amino acid 58; 

(c) fragments of the amino acid sequence of SEQ ID NO:17; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CT550_1 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

34. An isolated gene corresponding to the cDNA sequence of SEQ ID NO:16. 

35. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:18; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 112 to nucleotide 969; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 154 to nucleotide 969; 
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(d) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NO:18 from nucleotide 1 to nucleotide 423; 

(e) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT585_1 deposited under accession 
number ATCC 98278; 

(f) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CT585_1 deposited under accession number ATCC 98278; 

(g) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CT585_1 deposited under accession number 
ATCC 98278; 

(h) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of done CT585 JL deposited under accession number ATCC 98278; 

(i) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NO.19; 

(j) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID NO:19 having biological activity; 

(k) a polynucleotide which is an allelic variant of a polynucleotide of 
(a)-(h) above; 

(1) a polynucleotide which encodes a species homologue of the protein 
of (i) or (j) above ; and 

(m) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(j). 

36. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:19; 

(b) the amino acid sequence of SEQ ID NO:19 from amino acid 1 to 
amino acid 104; 

(c) fragments of the amino acid sequence of SEQ ID NO:19; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CT585J deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

37. An isolated gene corresponding to the cDNA sequence of SEQ ID NQ18. 
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38. A composition comprising an isolated polynucleotide selected from the 
group consisting of: 

(a) a polynucleotide comprising the nucleotide sequence of SEQ ID 

NO:20; 

(b) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NQ20 from nucleotide 37 to nucleotide 2766; 

(c) a polynucleotide comprising the nucleotide sequence of SEQ ID 
NOJ20 from nucleotide 243 to nucleotide 789; 

(d) a polynucleotide comprising the nucleotide sequence of the full- 
length protein coding sequence of clone CT797_3 deposited under accession 
number ATCC 98278; 

(e) a polynucleotide encoding the full-length protein encoded by the 
cDNA insert of clone CT797_3 deposited under accession number ATCC 98278; 

(f) a polynucleotide comprising the nucleotide sequence of the mature 
protein coding sequence of clone CT797_3 deposited under accession number 
ATCC 98278; 

(g) a polynucleotide encoding the mature protein encoded by the 
cDNA insert of clone CT797J3 deposited under accession number ATCC 98278; 

(h) a polynucleotide encoding a protein comprising the amino acid 
sequence of SEQ ID NQ21; 

(i) a polynucleotide encoding a protein comprising a fragment of the 
amino acid sequence of SEQ ID N021 having biological activity; 

(j) a polynucleotide which is an allelic variant of a polynucleotide of 
(aHg) above; 

(k) a polynucleotide which encodes a species homologue of the protein 
of (h) or (i) above ; and 

Q) a polynucleotide capable of hybridizing under stringent conditions 
to any one of the polynucleotides specified in (a)-(i)- 

39. A composition comprising a protein, wherein said protein comprises an 
amino acid sequence selected from the group consisting of: 

(a) the amino acid sequence of SEQ ID NO:21; 

(b) the amino acid sequence of SEQ ID NO:21 from amino acid 75 to 
amino acid 251; 
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(c) fragments of the amino acid sequence of SEQ ID NO:21; and 

(d) the amino acid sequence encoded by the cDNA insert of clone 
CT797_3 deposited under accession number ATCC 98278; 

the protein being substantially free from other mammalian proteins. 

40. An isolated gene corresponding to the cDNA sequence of SEQ ID NCh20. 
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FIGURE 1A 

Hindlll 1 




BamHI 2772 Clal 2666 



Plasmid name: pED6dpc2 
Piasmid size: 5374 bp 

Comments/References: pED6dpc2 is derived from pED6dpc1 by insertion of a new 
polylinker to facilitate cDNA cloning. SST cONAs are cloned between EcoRI and Notl. 
pED vectors are described in Kaufman et a). (1991), NAR 19: 4485-4490. 
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FIGURE IB 



Hindtll 1 




Sapl 2135 



Plasmid name: pNOTs 
Plasmid aiza: 4529 bp 



Commenta/References: pNOTs is a derivative of pMT2 (Kaufman et al.1989. Mol.Cell.Riol.9:1741-t750). 
DHFR was deleted and a new polyfinker was inserted between EcoRI and Hpal. M13 origin 
of replication was inserted in the Ctal site. SST cONAs are cloned between EcoRI and 
Notl 



2/2 

SUBSTITUTE SHEET (RULE 26) 



